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TUMOR NECROSIS FACTOR-a AND -J RECEPTORS 
BACKGROUND OF THE INVENTION 

-ce variety of effects on a large numj^o Te'toes ThI or^a , °' '""^"='"5 ^ 

-'63:19098, 1988; Hohrriann et al.. 8/0/ Chem ^Sliar lasif Aim. 1 1 ^ ' ^"^'^^ 
these TNF.Rs nf r(iff»«nt m-,i«^ , «fe>4.]4g27. 1989). Although the relationship between 

^noLn ,H ; . 'molecular mass .s unclear. Hohmarn et al. {J. Biol. Chem 26414927 i9^^ 

relationship of the above soluble urinary binding proteins was further elucidated after orininTn.ln! 

t^SSca^on lT"r ^'''V °' -eXi^trleo^^^ 
the .dentiftcation and purification of a second distinct soluble urinary TNF binding protein having an N- 

'Tin TT ^"'^^'"'^ °' Vaf-Ala-Phe-Thr-Pro- U Bio,. Chew. 265:153,. %90). TheUu^^a^ 

ir^^unocnem 1' ' " '"^ P^^^^^^^ns were shown oT 

■mnuinochemically related to two apparently distinct cell surface proteins by the ability of antiserum aaainst 
the binding proteins to inhibit TNF binding to certain cells. e ao ly or anDseri.m agains. 

rNFTanTtTn '^.TI' '^"^ '""'^"'"^ '='°"'"9 'xP^^sion of a human 55 kDa 

In N?« L T ' ■■ '^^^ ^'^361. ,990). The TNF-fl of both groups has 

Z^^ TT "^"""^^ corresponds to the partial arr^ino acid sequence of ?he urina" 

binding protein disclosed by UK 2 2,8 ,0, A. Engelmann at al. (,989) and Englelmann et al. (1990) 

orarlroT,'" tTT ^^'^^""'"^'P °' "'^ '^"'"P'^ 'P^'"' of TNF.R and soluble urinary TNF binding 
proteins, or to study the structural and biological characteristics of TNF-Rs and the role played by TNF-Rs 

T^T^ PPP"'^«P"5 » ™P Of otf^er cytokine stimulation, or to use TNF-Rs 

hnw« « VI °' """"^ compositions of TNF-R are needed. Such compositions, 

however, are obtainable m practical yields only by cloning and expressing genes encoding the receptors 
using recombinant ONA technology. Efforst to purify the TNF-R molecule for use in biochemVal anaS ^! 
to Clone and express mammalian genes encoding TNF-R. however, have been impeded by lack of a 

T^t hTf . °' '° '"^ P^^^^"' "P =«" -e^e known to 

express high levels of TNF-R constitutively and continuously, which precluded purification of receptor for 
sequencing or construction of genetic libraries for cONA cloning. 
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SUH/IMARY OF THE INVENTION 

The present invention provides .solated TNF receptors and ONA sequences encoding mammalian 
tumor necrosis factor receptors (TNF-R). ,n particular, human TNF-Rs. Such ONA sequences mcluce (a) 
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cONA clones having a nucleotide sequence derived frcnn the ceding region of a native TNf^R gene- ,b) 
ONA sequences which are capable of hybridization to the cDNA clones of (a) uncer moderately stringent 
conditions and which encode biologically active .TNF-R nnolecules: or (c) ONA sequences wmcn are 
degenerate as a result of the genetic code to the ONA sequences defined in (a) anc (b) and wnich encoce 

5 b.oiogically active TNF-R molecules, in panicuiar. the present invention provides ONA sequences which 
encode soluble TNF receptors. 

The present invention also provides recombinant expression vectors compnsing the ONA sequences 
defined above, recombinant TNF-R molecules produced using the recombinant expression vectors, and 
processes for producing the recombinant TNF-R molecules using the expression vectors. 

•0 The present invention also provides isolated or purified protein compositions comprising TNF-R. and. m 
particular, soluble forms of TNF-R. 

The present invention also provides compositions for use in therapy, diagnosis, assay of TNF-R. or m 
raising antibodies to TNF-R. comprising effective quantities of soluble native or recombinant receptor 
proteins prepared according to the foregoing processes. 

!5 Because of the ability of TNF to specifically bind TNF receptors (TNF-Rs). purified TNF-R compositions 
will be useful in diagnostic assays for TNF, as well as in raising antibodies to TNF receptor for use in 
diagnosis and therapy. In addition, purified TNF receptor compositions may be used directly in therapy to 
bind or scavenge TNF. thereby providing a means for regulating the immune activities of this cytokine. 
These and other aspects of the present invention will become evident upon reference to the following 

20 detailed descnption. 



BRIEF DESCRIPTION OF THE DRAWINGS 

25 

Figure 1 is a schematic representation of the coding region of various cDNAs encoding human and 
murine TNF-Rs. The leader sequence is hatched and the transmembrane region is solid. 
Figure 2A-2B depict the partial cDNA sequence and derived amino acid sequence of the human TNF-R 
clone 1. Nucleotides are numbered from the beginning of the 5 untranslated region. Amino acids are 

30 numbered from the beginning of the signal peptide sequence. The putative signal peptide sequence is 
represented by the amino acids -22 to -i. The N-terminal leucine of the mature TNF-R protein is 
underlined at position i. The predicted transmembrane region from amino acids 236 to 265 is also 
underlined. The C-termint of various soluble TNF-Rs are marked with an arrow (t). 
Figure 3A-3C depict the cONA sequence and derived amino acid sequence of murine TNF-R clone n. 

as The putative signai peptide sequence is represented by amino acids -22 to -i. i. The N-terminal valine 
of the mature TNF-R protein is underlined at position 1. The predicted transmembrane region from 
amino acids 234 to 265 is also underlined. 

DETAILED DESCRIPTION OF THE INVENTION 

40 



Definitions 

tfS As used herein, the terms "TNF receptor" and "TNF-R" refer to proteins having amino acid sequences 
which are substantially similar to the native mammalian TNF receptor amino acid sequences, and which are 
biologically active, as defined below, in that they are capable of binding TNF molecules or transducing a 
biological signal initiated by a TNF molecule binding to a cell, or cross-reacting with anti-TNF-R antibodies 
raised against TNF-R from natural (i.e., nonrecombinant) sources. The mature full-length human TNF-R is a 

so glycoprotein having a molecular weight of about 80 kilodaltons (kOa). As used throughout the specification, 
the term "mature** means a protein expressed in a form lacking a leader sequence as may be present in 
full-length transcripts of a native gene. Experiments using COS cells transfected with a cDNA encoding full- 
length human TNF-R showed that TNF-R bound '^^l-TNFa with an apparent K. of about S x 10' M~\ and 
that TNF-R bound '25|.tnF;3 with an apparent K, of about 2 x 10^ The terms "TNF receptor'* or "TNF- 

55 R** include, but are not limited to. analogs or subunits of native proteins having at least 20 amino acids and 
which exhibit at least some biological activity in common with TNF-R. for example, soluble TNF-R 
constructs which are devoid of a transmembrane region (and are secreted from the cell) but retain the 
ability to bind TNF. Various bioequivalent protein and amino acid analogs are described in detail below. 
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The "omenclature for TNF-R analogs as bsec'h^rsn roiicwr re convention of .*amtng -he -rc*o 
TNF-R) precsaed by either hu (for human) or nriu (for murine) and followed by a A (lo resign ate'' a'^'^i 
and the number of the C-termmai ammo acid. For example. nuTNF-RA235 refers to human TNF-R^^^'°"* 
A3p^-5 as the C-terminai ammo acid (i.e.. a polypeptide having the sequence of amino acids 1-235 of Fi""^^ 

5 2A). In the absence of any human or murine species designation. TNF-R refers genertcaily to mammlr^ 
TNF-R. Similarly, in the absence of any specific designation for deletion m.utants. the term TNF.R means'ail 
forms of TNF-R. including mutants and analogs which possess TNF-R biological activity. ^ 
"Soluble TNF-R- or "sTNF-R" as used in the context of the present mvention refer to protems or 
sucstantiaiiy equivalent analogs, having an amino acid sequence corresponcing lo ail or oan of the 

ro extracellular region of a native TNF-R. for example. huTNF.RA235. huTNF^RA,i85 and huTNF-RA;63 or 
amino acid sequences substantially similar to the sequences of ammo acids t-i63. amino acids 1-135 or 
amino acids 1-235 of Figure 2A. and which are biologically active m that they bind to TNF ligard 
Equivalent soluble TNF-Rs include polypeptides which vary from, these sequences by one or mora 
substitutions, deletions, or additions, and which retain the abiiity to bind TNF or inhibit TNF signa^l 

75 transduction activity via ceil surface bound TNF receptor proteins, for example huTNF-Rdx. wherem t is 
selected from the group consisting of any one of amino acids 163-235 of Figure 2A. Analogous ceietions 
may be made to muTNF-R. Inhibition of TNF signal transduction activity can be determined by transfecring 
cells with recombinant TNF-R ONAs to obtain recombinant receotor expression. The cells are then 
contacted with TNF and the resulting metabolic effects examined. If an effect results which is attributable to 

20 the action of the ligand. then the recombinant receptor has signal transduction activity. Exemplary 
procedures for determining whether a polypeptide has signal transduction activity are disclosed by Idzeraa 
et al.. J, Exp. Med. ;7/:861 (1990): Curtis et ai.. Proc. Nat/. Acad. Sci. USA 36:3045 (1989): Prywes et ai.. 
cMBO J. 5:2179(1986) and Chou et al.. J. BioL Chem. 26^1342 (19S7). Alternatively, primary cells or cell 
lines which express an endogenous TNF receptor and have a aetectable biological response to TNF could 

25 also be utilized. 

The term "isolated" or "purified'*, as used in the context of this specification to define the purity of 
TNF-R protein or protein compositions, means that the protein or protein composition is substantially free of 
other proteins of natural or endogenous origin and contains less than about 1% by mass of protein 
contaminants residual of production processes. Such compositions, however, can contain other proteins 

30 added as stabilizers, carriers, excipients or co-therapeutics. TNF-R is isolated if it is detectable as a single 
protein band in a poiyacryfamide gel by silver staining. 

The term ** substantially similar," when used to define either amino acid or nucleic acid sequences, 
means that a particular subject sequence, for example, a mutant sequence, varies from a reference 
sequence by one or more substitutions, deletions, or additions, the net effect of which is to retain biological 

35 activity of the TNF-R protein as may be determined, for example, in one of the TNF-R binding assays set 
forth in Example 1 below. Alternatively, nucleic acid subunits and analogs are "substantially similar" to the 
specific DNA sequences disclosed herein if: (a) the DNA sequence is derived from the coding region of a 
native mammalian TNF-R gene: (b) the DNA sequence is capable of hybridization to DNA sequences of (a) 
under moderately stringent conditions (50 'C. 2x SSC) and which encode biologically active TNF-R 

40 molecules: or DNA sequences which are degenerate as a result of the genetic code to the DNA sequences 
defined in (a) or (b) and which encode biologically active TNF-R molecules. 

"Recombinant,** as used herein, means that a protein is derived from, recombinant (e.g.. microbial or 
mammalian) expression systems. "Microbial" refers to recombinant proteins made in bacterial or fungal 
(e.g.. yeast) expression systems. As a product, "recombinant microbial" defines a protein produced in a 

45 microbial expression system which is essentially free of native endogenous substances. Protein expressed 
in most bacterial cultures, e.g., £ co//. will be free of glycan. Protem expressed in yeast may have a 
glycosyfation pattern different from that expressed in mammalian ceils. 

"Biologically active." as used throughout the specification as a characteristic of TNF receptors, means 
that a particular molecule shares sufficient amino acid sequence similarity with the embodiments of the 

so present invention disclosed herein to be capable of binding detectable quantities of TNF. transmitting a TNF 
stimulus to a cell, for example, as a component of a hybrid receptor construct, or cross-reacting with anti- 
TNF-R antibodies raised against TNF-R from natural (i.e., nonrecombinant) sources. Preferably, biologically 
active TNF receptors within the scope of the present invention are capable of binding greater than O.i 
nmoles TNF per nmole receptor, and most preferably, greater than 0.6 nmole TNF per nmole receptor m 

55 standard binding assays (see below), 

"Isolated DNA sequence" refers to a DNA polymer, in the ?orm of a separate fragment or as 2 
component of a larger DNA construct, which has been derived from DNA isolated at least once in 
substantially pure form. i.e.. free of contaminating endogenous materiais and in a quantity or concentration 
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snabling ,aent,ficat,on. manipulation, and recovery of the sequence and its component nuciect.c 

quences Dy standard biochemical r^ethods. for example, usmg a cloning vector. Such sec en 4 re 

prereracly provided in the form of an open reading frame uninterrupted by interna nontran la e'd I 

quences. or mtrons, which are typically present in eu.aryo.ic genes. Genomic ONA containing theX. nt 

oresent? or V T 'V''' " ' °' ""'"^ ^^^'^^""^^ ^"^^'^''^ °' non-translated'oNA may be 

present 5 or 3 from the open reading frame, where the same do not interfere with manipulation or 
expression of the coding regions. 'puiduon or 

"Nucleotide sequence" refers to a heteropoiymer of deoxyribonucleotides. DNA sequences enccdino 
the proteins provided by this invention can be assembled from cDNA fragments and short oligonucleotice 
linkers, or from a series of oligonucleotides, to provide a synthetic gene which is capable of teZ 

expressed in a recombinant transcriptional unit. 
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Isolation of cDNAs Encoding TNF-R 
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The coding sequence of TNF-fl ,s obtained by isolating a complementary ONA (cONA) sequence 
encoding TNF-R from a recombinant cONA or genomic ONA library. A cONA library ,s preferably 
constructed by obtaining poiyadenyiated mRNA from a particular cell line which expresses a mammalian 
TNF-R. for example, the human fibroblast cell line WI-26 VA4 (ATCC CCL 95.1) and using the mRNA as a 
template for synthesizing aouble stranded cONA. The aouble stranded cONA is then packaged into a 
recombinant vector, which is introduced into an appropriate £ colt strain and propagated. Murine or other 
mammalian cell lines which express TNF-R may also be used. TNF-R sequences contained in the cDNA 
library can be readily identified by screening the library with an appropriate nucleic acid probe which is 
capable of hybridizing with TNF-R cONA. Alternatively. ONAs encoding TNF-R proteins can be assembled 
by ligation of synthetic oligonucleotide subunits corresponding to ail or part of the sequence of Figures 2A- 
2B or 3A-3C to provide a complete coding sequence. 

The human TNF receptor cDNAs of the present invention were isolated by the method of direct 
expression cloning. A cDNA library was constructed by first isolating cytoplasmic mRNA from the human 
fibroblast cell line WI-26 VA4. Poiyadenyiated RNA was isolated and used to prepare double-stranded 
30 cONA. Purified cONA fragments were then ligated into pCAV/NOT vector DNA which uses regulatory 
sequences derived from pDC20l (a derivative of pMLSV, previously described by Cosman et al.. Nature 
3/2:768. 1984). SV40 and cytomegalovirus ONA. described in detail below in Example 2. pCAV NOT has 
been deposited with the American Type Culture Collection under accession No. ATCC 68014. The 
pCAWNOT vectors containing the WI26-VA4 cONA fragments were transformed into £ co/i strain DHSa. 
35 Transformants were plated to provide approximately 800 colonies per plate. The resulting colonies were 
harvested and each pool used to prepare plasmid ONA for transfection into COS-7 cells essentially as 
described by Cosman et ai. {Nature 3/2:768. 1984) and Luthman et al. {NucL Acid Res. /;:1295. 1983). 
Transformants expressing biologically active cell surface TNF receptors were identified by screening for 
their ability to bind '^si.jNF. In this screening approach, transtected COS-7 cells were incubated with 
40 medium containing 'zSf-TNF. the cells washed to remove unbound labeled TNF. and the cell monolayers 
contacted with X-ray film to detect concentrations of TNF binding, as disclosed by Sims et al. Science 
24V.SQ5 (1988). Transfectants detected in this manner appear as dark foci against a relatively light 
background. 

Using this approach, approximately 240.000 cONAs were screened in pools of approximately 800 
45 cONAs until assay of one transfectant pool indicated positive foci for TNF binding. A frozen stock of 
bacteria from this positive pool was grown in culture and plated to provide individual colonies, which were 
screened until a single clone (clone 1 1) was identified which was capable of directing synthesis of a surface 
protein with detectable TNF binding activity. The sequence of cONA clone 1 1 isolated by the above method 
is depicted in Figures 3A-3C. 
50 Additional cONA clones can be isolated from cDNA libraries of other mammalian species by cross- 
species hybridization. For use in hybridization, ONA encoding TNF-R may be covalently labeled with a 
detectable substance such as a fluorescent group, a radioactive atom or a chemiluminescent group by 
methods well known to those skilled in the art. Such probes could also be used for in vitro diagnosis of 
particular conditions. 

55 Like most mammalian genes, mammalian TNF receptors are presumably encoded by multi-exon genes. 
Alternative mRNA constructs which can be attributed to different mRNA splicing events following transcrip- 
tion, and which share large regions of identity or similarity with the cDNAs claimed herein, are considered 
to be within the scope of the present invention. 
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Other mammalian TNF-R cONAs are 
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5 Proteins and Analogs 

«... ™„o„,. v;,; s"sis"r, lo^Tzr ,s;;'r™vroZ^^^^^^ 

:nveot,on are optionally without associated native-pattern glycosylation Poiypepnoes of .h,s 

Mammalian TNF-R of the present invention includes, by way of examoi*. r,rina,o h, 

^5 speaes ybr.d.zat.on. us.ng a s.ngie stranded cONA derived from the human TNF-R ONA seaLiVs a 
hyona.zation prooe to isolate TNP.fl cONAs from mammalian cONA libraries 

onm^rv''.'!"' ^^'^ /'^iJ^: mvention also include various structural forms of :he 

for example, a . NF-R prote.n may be ,n the form of acidic or basic salts, or may be in neu ral form 
20 individual amino acid residues may also be modified by oxidation or reduction 

.ne primary amino acid structure may be modified by forming covalent or aggregative comugates w.th 
other chem.cai mo.et.es. such as glycosyl groups, lipids, phosphate, acetyl groucs and the like, or b 
nT^^^^T^lT '^"^"""^ derivatives are prepared by linking partcular functional 

groups CO TNF-R am.no acid side chains or at the N-cr C-termini, Other derivatives of TNF-R within the 
scope of rh.s -nvention include covalent or aggregative conjugates of TNF-R or its fragments with other 
protems or polypeptides, such as by synthesis .n recombinant culture as N-terminal or C-terminal fusions 
ror example, the conjugated peptide may be a a signal (or leader) polypeptide sequence at the N-terminal 
region or the protein which co-translationally or post-translationally directs transfer of the protein from its 
s.te of synthesis to its site of function inside or outside of the cell membrane or wall (e.g.. the yeast a-facror 
^ader). TNF-R prote.n fusions can comprise peptides added to facilitate purification or identification of TNF. 
R {e.g.. polyHis). The ammo acid sequence of TNF receptor can also be linked to the peptide Aso-Tyr-Lys- 
Asp- Asp-Asp-Asp-Lys (DYKDOOOK) (Hopp et aL, Bio/Technology 5:1204.1988.) The latter sequence is 
Mighiy antigenic and provides an epitope reversibly bound by a specific monoclonal antibody enabling 
rapid assay and facile purification of expressed recombinant protein. This sequence is also specifically 
cleaved by bovine mucosal enterokinase at the residue immediately following the Asp-Lys pairing. Fusion 
prote-ns capped with this peptide may also be resistant to intracellular degradation in £ coli. 

TNF-R derivatives may also be used as immunogens. reagents in receptor-based immunoassays, or as 
binding agents for affinity purification procedures of TNF or other binding ligands. TNF-R derivatives may 
also be Obtained by cross-linking agents, such as M-maleimidobenzo/l succmimide ester and N-hydrox- 
ysuccin.miae. at cysteine and lysine residues. TNF-R proteins may also be covalently bound through 
reacove side groups to various insoluble substrates, such as cyanogen bromide-activated, bisoxirane- 
activated, carbonyldiimidazole-activated or tosyl-activated agarose structures, or by adsorbing to polyoiefin 
surraces (with or without glutaraldehyde cross-linking). Once bound to a substrate. TNF-R may be used to 
selectively bind (for purposes of assay or purification) anti-TNF-R antibodies or TNF. 
45 The present invention also includes TNF-R with or without associated native-pattern glycosylation. TNF- 
R expressed in yeast or mammalian expression systems, e.g.. COS-7 cells, may be similar or slightly 
different m molecular weight and glycosylation pattern than the native molecules, depending upon the 
expression ^ystem. Expression of TNF-R ONAs in bacteria such as £ coli provides non-glycosyiated 
molecules. Functional mutant analogs of mammalian TNF-R having inactivated N-glycosylation sites can be 
50 produced by oligonucleotide synthesis and ligation or by site-specific mutagenesis techniques. These 
analog proteins can ce produced in a homogeneous, reduced-carbohydrate form in good yield using yeast 
expression systems. N-glycosyiation sites in eukaryotic proteins are characterised by the amino acid triplet 
Asn-AfZ. where A, is any amino acid except Pro. and 2 is Ser or Thr. In this sequence, asoaragine 
provides a side chain amino group for covalent attachment of carbohydrate. Such a site can be eliminated 
55 by substituting another amino acid for Asn or for residue 2. deleting Asn or 2. or inserting a non-2 ammo 
acid between A, and 2. or an amino acid other than Asn between Asn and A, . 

TNF-R derivatives may also be obtained by mutations of TNF-R or its subunits. A TNF-R mutant, as 
referred tc herein, is a polypeptide homologous to TNF-R but which has an amino acid sequence different 
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from native TNF-R because of a deieticn. mserticn or substitution. ■ ' 

Bicequivalent analogs of TNF-R proteins may be construcreC by. for example, making vanots 
substitutions of residues or sequences or ceiettng terminal or interna! residues or sequences not needed for 
biological activity; For example, cysteine residues- can be deleted (e.g.. Cys='3) replaced with other 

5 amino acids to prevent formation of unnecessary or incorrect intramolecular disulfide bridges upon 
renaturation. Other approaches to mutagenesis involve modification of adjacent dibasic ammo acid residues 
to enhance expression in yeast systems in which K5X2 protease activity is present. Generally, substitutions 
should be made consen/atively; i.e.. the most preferred substitute amino acids are those hav.ng 
•physiochemical characteristics resembling those of the residue to be replaced. Similarly, when a deletion or 

:o insertion strategy is adopted, the potential effect of the deletion or insertion on biological activity should be 
considered. Substantially similar polypeptide sequences, as defined above, generally comprise a like 
number of amino acids sequences, although C-terminal truncations for the purpose of constructing soluble 
TNF-Rs will contain fewer ammo acid sequences. In order to preserve the biological activity of TNF-Rs. 
deletions and substitutions will preferably result in homologous or conservatively substituted sequences. 

ts meaning that a given residue is replaced by a biologically similar residue. Examples of conservative 
substitutions include substitution of one aliphatic residue for another, such as fie. Val. Leu. or Ala for one 
another, or substitutions of one polar residue for another, such as between Lys and Arg; Glu and Asp; or 
Gin and Asn. Other such conservative substitutions, for example, substitutions of entire regions having 
similar hydrophobicity characteristics, are well known. Moreover, particular amino acid differences between 

20 human, murine and other mammalian TNF-Rs is suggestive of additional conservative substitutions that 
may be made without altering the essential biological characteristics of TNF-R. 

Subunits of TNF-R may be constructed by deleting terminal or internal residues or sequences. 
Particularly preferred sequences include those in which the transmembrane region and intracellular domain 
of TNF-R are deleted or substituted with hydrophiiic residues to facilitate secretion of the receptor into the 

25 cell culture medium. The resulting protein is referred to as a soluble TNF-R molecule which retains its 
ability to bind TNF. A particularly preferred soluble TNF-R construct is TNF-RA235 (the sequence of amino 
acids 1-235 of Figure 2A). which comprises the entire extracellular region of TNF-R, terminating with Asp^^s 
immediately adjacent the transmembrane region. Additional amino acids may be deleted from the trans- 
membrane region while retaining TNF binding activity. For example. huTNF-RAl83 which comprises the 

30 sequence of amino acids 1-183 of Rgure 2A. and TNF-R a 163 which comprises the sequence of amino 
acids 1-163 of Figure 2A. retain the ability to bind TNF ligand as determined using the binding assays 
described below in Example 1. TNF-RA142. however, does not retain the ability to bind TNF ligand. This 
suggests that one or both of Cys'^' and Cys'" is required for formation of an intramolecular disulfide 
bridge for the proper folding of TNF-R. Cys'^^. which was deleted without any apparent adverse effect on 

35 the ability of the soluble TNF-R to bind TNF. does not appear to be essential for proper folding of TNF-R, 
Thus, any deletion C-terminal to Cys'^^ would be expected to result in a biologically active soluble TNF-R. 
The present invention contemplates such soluble TNF-R constructs corresponding to ail or part of the 
extracellular region of TNF-R terminating with any amino acid after Cys'^V Other C-terminal deletions, such 
as TNF-FA157. may be made as a matter of convenience by cutting TNF-R cDNA with appropriate 

40 restriction enzymes and. if necessary, reconstructing specific sequences with synthetic oligonucleotide 
linkers. The resulting soluble TNF-R constructs are then inserted and expressed in appropriate expression 
vectors and assayed for the ability to bind TNF, as described in Example i . Biologically active soluble TNF- 
Rs resulting from such constructions are also contemplated to be within the scope of the present invention. 
Mutations in nucleotide sequences constructed for expression of analog TNF-R must, of course. 

45 preserve the reading frame phase of the coding sequences and preferably will not create complementary 
regions that could hybridize to produce secondary mRNA structures such as loops or hairpins which would 
adversely affect translation of the receptor mRNA. Although a mutation site may be predetermined, it is not 
necessary that the nature of the mutation oer $e be predetermined. For example, in order to select for 
optimum characteristics of mutants at a given site, random mutagenesis may be conducted at the target 

50 codon and the expressed TNF-R mutants screened for the desired activity. 

Not all mutations in the nucleotide sequence which encodes TNF-R will be expressed in the final 
product, for example, nucleotide substitutions m.ay be made to enhance expression, primarily to avoid 
secondary structure loops in the transcribed mRNA (see EPA 75.444A. incorporated herein by reference), 
or to provide codons that are more readily translated by the selected host, e.g., the well-known c. coii 

55 preference codons for £. colt expression. 

Mutations can be introduced at particular loci by synthesizing oligo/^ucleotides containing a mutant 
sequence, flanked by restriction sites enabling ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes an analog having the desired amino acid msertion. 
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sucsitiuticn; or deienon. 

Alternatively, oligonucleoticie-direcrec ^it^-soectfic nurprpopcc n.^^»^ 
an altere. gene having panic. co.o.'S a^c J^'o 

!ri!;- - ' " "^^"^ Craik rs/o fec/io/ai/es. January ,985 w h 

(Genef,c cngmeeting: Pnnciptes ana Methoas. Plenum Press i98i)- and u s pVr!!; L c T 
4,737,462 disclose suitable ,echr,ipues. and are incorporated by Mferlncetrefn ' °" 

ootn monovalent forms and oolvvalent fnrm« nr tmc d • 
:his invention. Polyvalent 'orn./,:^^: ^Z ' Z:;^':-^ 
cvalent soluble TNF-R a,ay consist of two tandem repeats of amin'o ds 235 of Sure a^UoT'''' ' 
a linker region. Alternate polyvalent forms may also be constructed torLTr.il k "^'^ 
TNF-R ,0 any clinically acceptable carrier molecule, a polyme se e^^^^^^^^ 
PSlye^hyje^ or de.tran using convent.ona, Tplinrte^X 7^^^^^^ 
Chemically coupled to biotin. the biotin-TNF-R coniogate then allowed to bind to avioin resZo 

tnn.trophenol (TNP) and the result.ng conjugate precipitated with anti-ONP or ant,-TNP.|QM to form 
decameric conjugates with a valency of lO for TNF-R binding sites ^ " i ink igM, .o form 

A recombinant chimeric antibody molecule may also be produced having TNF-R sequences substituted 
for .^e vanable domains of either or both of the immunogiubulin molecui heavy and light chin and 
naving unmodined constant region domains. For exam.ple. chimenc TNF-R/IgG, may be produced ^om ^^o 
Chimeric genes ~ a TNF-ahuman , light chain chimera (TNF-ac,, and a TNF-^hum^ t hea y c^^^ 

LZhJ r '-'' i °''r'"' '"^ °' senes. the gene produce 

forms Jf Sp ^'^'^y ^--"9 ^NF-R displayed b-valently. Such polyvatem 

Sn Jr . ; " """"^ """"^ '^""y <^eta.ls relating to the 

construction of such chimenc antibody molecules are disclosed in WO 89/09622 and EP 315062. 



Expression of Recombinant TNF-R 

TMP^^^'"®"! '""®"**"" recombinant expression vectors to amplify or express DNA encoding 

I . R«;0"'b'na"t expression vectors are replicable ONA constmcts which have synthetic or cDNA- 
denved ONA fragments encoding mammalian TNF-R or bioequivalent analogs operably linked to suitable 
transcnptional or translational regulatory elements derived from mammalian, microbial, viral or insect genes 
A transcnptional unit generally comprises an assembly of (i) a genetic element or elements having a 
regulatory role m gene expression, for example, transcriptional promoters or enhancers. (2) a stnjctural or 
coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcrip- 
tion and translaton initiation and termination sequences, as described in detail below. Such regulatory 
elements may include an operator sequence to control transcription, a sequence encoding suitable mRNA 
nbosomal binding sites. The ability to replicate in a host, usually conferred by an origin of replication and a 
selection gene to facilitate recognition of transformants may additionally be incorporated. DNA regions are 
operably linked when they are functionally related to each other. For example. ONA for a signal peotide 
(secretory leader) is operably linked to ONA for a polypeptide if it is expressed as a precursor which 
participates m me secretion of the polypeptide: a promoter is ooerabiy linked to a coding sequence if it 
controls the transcription of the sequence: or a ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to permit translation. Generally, operably linked means contiguous and in 
the case of secretory leaders, contiguous and in reading frame. Structural elements intended for use in 
yeast expression systems preferably include a leader sequence enabling extracellular secretion of trans- 
lated protein by a host cell. Alternatively, where recombinant protein is expressed without a leader or 
transport sequence, it may include an N-terminal methionine residue. This residue may ootionally be 
subsequently cleaved from the expressed recombinant protein to provide a final product. 

DNA sequences encoding mammalian TNF receptors which are to be expressed in a microorganism 
will preferably contain no introns that could prematurely terminate transcription of ONA into mRNA: 
however, premature termination of transcription may be desirable, for example, where it would result in 
mutants having advantageous C-terminal truncations, for example, deletion of a transmembrane region to 
yield a soluble receptor not bound to the cell membrane. Due to code degeneracy, there can be 
considerable variation in nucleotide sequences encoding the same amino acid sequence. Other embodi- 
ments include sequences capable of hybridizing to the sequences of the provided cONA under moderately 
stringent conditions (SO C. 2x SSC) and other sequences hybridizing or degenerate to those which encode 
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fciclogicaily active TNF receptor polypeotides. ■ 

Recorr.ciriart TNF-R ONA is expressed or amp.ifiea in a recombinant exoression system ccmcr.sinc ^ 
substantially homogeneous monoculture of suitable host microorganisms, for examole bac^ena 3ucn ^' 
coh or yeast such as S. cerevisiae. which have'statly integrated (by transformation or transfec:ic"n» 
5 rscomb-nant transcnptional unit into chromosomal ONA or carry the recombinant transcnotional. ur.t as a 
component of a resident plasmid. Generally, cells consntuting the system are the progeny of a sincte 
ancestral transformant. Recombinant expression systems as defined herein will express heteroloccus 
protein upon induction of the regulatory elements linked to the ONA sequence or synthetic cene -e 
• expressed. ^ 

ro Transformed host cells are cells which have been transformed or transfected with TNF-R vectors 
constructed using recombinant ONA techniques. Transformed host cells ordinarily express TNF-R but host 
cells transformed for purposes of cloning or amplifying TNF-R ONA co not need to express TNF-R 
Expressed TNF-R will be deposited in the cell membrane or secreted into the culture supernatant* 
depending on the TNF-R ONA selected. Suitable host cells for expression of mammalian TNF-R induce 
fS prokaryotes. yeast or higher eukaryotic cells under the control of appropriate promoters. Prokaryotes 
include gram negative or gram positive organisms, for example E. coli or bacilli. Higher eukaryotic cells 
include established cell lines of mammalian origin as described below. Cell-free translation systems ccuic 
also be employed to produce mammalian TNF-R using RNAs derived from the ONA constructs of the 
present invention. Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and 
20 mammalian cellular hosts are described by Pouweis et al. {Cloning Vectors: A Laboratory Manual. 
Elsevier. New York. 1985), the relevant disclosure of which is hereby incorporated by reference. 

Prokaryotic expression hosts may be used for expression of TNF-R that do not require extensive 
proteolytic and disulfide processing. Prokaryotic expression vectors generally comprise one or more 
phenotypic selectable markers, for example a gene encoding proteins conferring antibiotic resistance or 
25 supplying an autotrophic requirement, and an origin of replication recognized by the host to ensure 
amplification within the host. Suitable prokaryotic hosts for transformation include E. co/i, BaciUus subtHis, 
Salmonella typhimuhum, and various species within the genera Pseuaomonas, Streptomyces, and 
Staphyoiococcus. although others may also be employed as a matter of choice. 

Useful expression vectors for bacterial use can comprise a selectable marker and bacterial origin of 
30 replication derived from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example. pKK223-3 
(Pharmacia Rne Chemicals. Uppsala. Sweden) and pGEMi (Promega Biotec. Madison. Wl. USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be 
expressed. E. coli is typically transformed using derivatives of pBR322. a plasmid derived from an E. coli 
3S species (Bolivar et al.. Gene ^95. 1977). pBR322 contains genes for ampiciilin and tetracycline resistance 
and thus provides simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include the ^-lactamase 
(penicillinase) and lactose promoter system (Chang et al.. Nature 275:615, 1978; and Goeddei et al.. 
Nature 281.SM. 1979). the tryptophan (trp) promoter system (Goeddei et al.. NucL Acids Res. 8:4057. 
40 1980; and EPA 36.776) and tac promoter (Maniaiis. Molecular Cloning: a Laboratory Manual, Cold Spring 
Harbor Laboratory, p. 412, 1982). A particularly useful bacterial expression system employs the phage x P[, 
promoter and cl857ts thermolabile repressor. Plasmid vectors available from the American Type Culture 
Collection which incorporate derivatives of the x P,. promoter include plasmid pHUB2. resident in E. coli 
strain JMBQ (ATCC 37092) and pPLc28. resident in E. coli RRl (ATCC 53032). 
45 Recombinant TNF-R proteins may also be expressed in yeast hosts, preferably from the Sac- 
charomyces species, such as S. cerevisiae. Yeast of other genera, such as Pichia or Kluyveromyces 
may also be employed. Yeast vectors will generally contain an origin of replication from the 2u yeast 
plasmid or an autonomously replicating sequence (ARS), promoter. ONA encoding TNF-R. sequences for 
polyadenylation and transcription termination and a selection gene. Preferably, yeast vectors will include an 
50 origin of replication and selectable marker permitting transformation of both yeast and E. coli. e.g.. the 
ampiciilin resistance gene of £ coli and S. cerevisiae TRPi or URA3 gene, which provides a selection 
marker for a mutant strain of yeast lacking the ability to grow in tryptophan, and a promoter derived from a 
highly expressed yeast gene to induce transcription of a structural sequence downstream. The presence of 
the TRPi or URA3 lesion in the yeast host cell genome then proviaes an effective environment for 
55 detecting transformation by growth in the absence of tryptophan or uracil. 

Suitable promoter sequences in yeast vectors include the promoters for metallothionein, 3- 
phosphoglycerate kinase (Hitzeman et al.. J. Biol. Chem. 255:2073. 1930) or other glycolytic enzymes 
(Hess et al., J. A<3v. Enzyme Peg. 7:i49. 1968: and Holland et ai., Sicchem. ; 74900. i978). such as 
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25 



Russell et al. (J. ao/. Crte^ 258-2674 or,: ri" '*^^"^^- '''^ promoter has oeen cescnbec by 
leader. .... oirec:s secre.of '''ei^^^^^^^^^^^ ' r.e yeast ..ac-o^ 

'0 structural gene to be expressed. See. e.g Kurjan ^ a ' cX/ 3V933 ^"^^ 
^ca</. Sci. USA 3;:S330. 1984 The leader seauLrl . f ' ^"'^ ^' • ''^^^ 

«ir:,r~c sr;rr^;%"s°'4?2 r^--- ■= 

'5 selective medium consisting of 0 67% veast nX'r^!nJ IV ' '^^"'""^ '^^"sformants ,n a 

adenine and 20 u^-ml uracJor URa : t XCts ,n mZ' ^'^^^^^ '° ^^ -l 

and bases as described by Sherman et ^ rarora^rc Z.'"^'"'""?,"' ° ""^ 

Cold Spnng Harbor LaboraLy. Cold Spr.ng Hat^ S Jorf, 986^^^^^ " ''''' 

. .c. "n^Sir rSit^l % Ve^^^^^^^^ ,n a 

ug.ml adenine and 80 ag/ml urac^r OereprSsit 0?!.^!^^^^ ' °' *" 9'""'® supplemented w.th 30 
glucose, crude yeas, sup'ernaiS a^eTa^s r b t aJon n^nTa; ^ "^^^ TT'"'"^' 

for examo^ L ce«s Jl27 3T3 Chinll h T '^'"^'^ °' ''''^^'''"S an appropriate vector mcluding, 

verte'TrttrcermTbrol^^^ """^""'^^^ " '° ''^ - '^^^'o-^S 

aporLmat-iv °' "'^^^ ^^'^ '^^S-^^"" '^^^ ^'^o be used, provided the 

approximately 2S0 bp sequence extending from the Hin<3 3 site toward the Ban site located in thrv rai 

ZnceJm'rvt" ^^-^-^ P-^ter'^olT^ o '^i 

Sonal dTrt^, L ' '^"^""^ ^« -'t*^ host cell chosen 

mamrSSn ri^^ '"^""""^ °' ' '"'"""^"^ expression vector to produce a recomo^nant 

mammal-an rNF receptor are provided in Examples 2 and 7 below. Exemplary vectors can be coLru c ed 
as disclosed by Okayama and Berg [Mol. Cell. Biol. 3:280. 1983). Y ecio s oe cons>ruc.ed 

A useful system for stable high level expression of mammalian receptor cDNAs in Ci27 murine 
2Sr798r""' ""^'"'^"^ " et al. (^ci « 

are staS^trfJr^T'! ? '^"'"'''"ant expression vectors comprising TNF-R cONAs - 

ZoZ^'JnZZ S . transforming a host cell with a vector comprising a ONA sequence wh.ch 
wh"h enc^d« T " ' ' ^'"5- ■''^ "^^y ^'^P <=°-°"s« ^ seouence 

Which cT^SjZ nlT°'"'"- cel. may be co-transformed with a second vector 

wh,ch comprises the ONA sequence which encodes the desired protein. The transformed or co-transformea 
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nosr cans are ^hen cultured .n increasing concantraforr; 'o/ the known 'druy; -herecy seiectmg fcr ar-c- 
res.siam ceils. Such drug-resistant cells survive in increased concentraricns of (he tox.c arug by over- 
production of the enzynne which is inhibited by the drug, frequently as a result of amotification of rhe gene 
ancod.ng the enzyme. Where drug resistance is caused by an .ncrease in the copy number of the vector 
ONA encoding the .nn.b.table enzyme, there is a concomitant co-amplification of the vector ONA ancoaing 
the desired protein (TNF-R) in the host cell's ONA. 

A preferred system for such co-amplification uses the gene for dihydrofolate reductase (DHFR) which 
can oe inhibited by the drug methotrexate (MTX). To achieve co-ampiification. a host cell which lacks an 
acuve gane encoding OHFR is either transformed with a vector which comprises DNA sequence encoding 
OnFR and a desired protein, or is co-transformed with a vector comprising a DNA sequence encoding 
DHFR and a vector comorising a ONA sequence encoding the desired protein. The transformed or co- 
transTormed host cells are cultured in media containing increasing levels of MTX. and those c-lls' lines 
wnich survive are selected. 

A particularly preferred co-ampiification system uses the gene for gtutamine synthetase (GS) which is 
responsible for the synthesis of glutamate and ammonia using the hydrolysis of ATP to ADP ana phosphate 
to dnve the reaction. GS is subject to inhibition by a variety of inhibitors, for example methionine 
sulDhoximine (MSX). Thus. TNF-R can be expressed in high concentrations by co-amplifying cells 
transformed with a vector comprising the ONA sequence for GS and a desired protein, or co-transformed 
with a vector comprising a ONA sequence encoding GS and a vector comprising a DNA sequence encoding 
the desired protein, culturing the host cells in media containing increasing levels of MSX and selecting for 
surviving ceils. The GS co-amplification system, appropriate recombinant expression vectors and cells lines, 
are described in the following PCT applications: WO 87.04462, WO 89/01036 WO 89/10404 and WO 
36/05307. 

Recombinant proteins are preferably expressed by co-amplification of OHFR or GS in a mammalian 
host cell, such as Chinese Hamster Ovary (CHO) ceils, or alternatively in a murine myeloma cell line, such 
as SP2'0-Agl4 or NSO or a rat myeloma cell line, such as Y823.0-Ag20, disclosed in PCT applications 
WO/89/10404 ana WO 86/05807. 

A preferred eukaryotic vector for expression of TNF-R ONA is disclosed below in Example 2. This 
vector, referred to as pCAV/NOT. was derived from the mammalian high expression vector pOC20l and 
contains regulatory sequences from SV40, adenovirus-2. and human cytomegalovinjs. 



Purification of Recombinant TNF-R 



Purified mammalian TNF receptors or analogs are prepared by culturing suitable hosf vector systems to 
express the recombinant translation products of the ONAs of the present invention, which are then purified 
from cuiture media or cell extracts. 

For example, supematants from systems which secrete recombinant protein into cuiture media can be 
first concentrated using a commercially available protein concentration filter, for example, an Amicon or 
Mitiipore Petlicon ultrafiltration unit. Following the concentration step, the concentrate can be applied to a 
suitable purification matrix. For example, a suitable affinity matrix can comprise a TNF or lectin or antibody 
molecule bound to a suitable support. Alternatively, an anion exchange resin can be employed, for example, 
a matrix or substrate having pendant diethyiamincethyl (DEAE) groups. The matrices can be acrylamide, 
agarose, dextran. cellulose or other types commonly employed in protein purification. Alternatively, a cation 
exchange step can be empioyed. Suitable cation exchangers include various insoluble matrices comprising 
sulfopropyt or carooxymethyl groups. Suifopropyl groups are preferred. 

Finally, one or more reversed-phase high performance liquid chromatography (RP-HPLC) steps employ- 
ing hydrophobic RP-HPLC media, e.g.. silica gel having pendant methyl or other aliphatic groups, can be 
empioyed to further purify a TNF-R composition. Some or all of the foregoing purification steps, in various 
combinations, can also be employed to provide a homogeneous recombinant protein. 

Recombinant protein produced in bacterial culture is usually isolated by initial extraction from cell 
pellets, followed by one or more concentration, salting-out, aqueous ion exchange or size exclusion 
chrcm.atography steps. Finally, high performance liquid chromatography (HPUC) can be employed for final 
purification steps. Microbial cells employed in expression of recombinant mammalian TNF-R can be 
disrupted by any convenient method, including freeze-thaw cycling, sontcation. mechanical disruption, or 
use of cell lysing agents. 

Fermentation of yeast which express mammalian TNF-R as a secreted protein greatly simplifies 
purification. Secreted recombinant protein resulting from a iarge-scale fermentation can be purified by 



1 1 



£P 0 418 014 A1 



^5 



2Q 



25 



20. 



35 



40 



45 



50 



55 



Human TNF-R synthesized in recombinant culture is characterized by -he presence of non h ^ 
components, .ncluding proteins, in amounts and of a character wh.ch deoend^.^n the Zcaln'" 
a.en to recover human TNF-R from the culture. These components ordinarily Tbe o yea^^ P^^^^^^^^^^ 

orderTf"., ^" contarSinan u Sies 

?NF fl le r" °K ^ cell culture enables the proo Son o 

TNF-R free of protems wh,ch may be normally associated with TNF-R as it is found in nature in 1,7,0. 
or ongin, e.g. in cells, cell exudates or body fluids. ^^^'^^ 

Therapeutic Administration of Recombinant Soluble TNF-R 

The present invention provides methods of using therapeutic compositions compnsing an effec'ive 
renlnt ^ "'"'"^ -"^°<^^ for' suppressing "n" 

?NF R proter'"""''°'^ administering an effective amount of soluble 

^rJlT,^'^'"'''' "'""^ ^'^'"'"'ste^ed to a patient, preferably a human for 

treatment ,n a manner appropriate to the indication. Thus, for example, soluble TNF-R protein compositions 
can be adrri.n.stered by bolus injection, continuous infusion, sustained release from implants, or other 
suitable technique. Typically, a soluble TNF-R therapeutic agent will be administered in the form of a 
composition composing purified protein in conjunction with physiologically acceptable carriers, excipients or 

nTir'i T ""1 '° « concentrations employed. 

Ordinarily, the preparation of such compositions entails combining the TNF-R with buffers, antioxidarts such 
as ascorbic acid, low molecular weight (less than about 10 residues) polypeptides, proteins, amino acids, 
carbohydrates including glucose, sucrose or dextrins. chelating agents such as SDTA. glutathione and other 
stabilizers and excipients. Neutral buffered saline or saline mixed with conspecific serum albumin are 
exemplary appropriate diluents. Preferably, product is formulated as a lyophilizate using appropriate 
exopient solutions (e.g.. sucrose) as diluents. Appropriate dosages can be determined in trials. The amount 
and frequency of administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of the patient, and so forth 

Soluble TNF-H proteins are administered for the purpose of inhibiting TNF-dependent responses A 
variety of diseases or conditions are believed to be caused by TNF. such as cachexia and septic shock In 
addition, other key cytokines (IL-1. IL-2 and other colony stimulating factors) can also induce significant host 
production of TNF. Soluble TNF-R compositions may therefore be used, for example, to treat cachexia or 
septic shock or to treat side effects associated with cytokine therapy. Because of the primary roles IL-i and 
lL-2 play in the production of TNF. combination therapy using both IL-I receptors or IL-2 receptors may be 
preferred m the treatment of TNF-associated clinical indications. 

The following examples are offered by way of illustration, and not by way of limitation. 

EXAMPLES 



Example 1 



Binding Assays 

A. Radiolabeling at TNFa ana rNFg. Recombinant human TNFa. in the form of a fusion protein 
containing a hydrophilic octapeptide at the N-terminus. was expressed in yeast as a secreted protein and 
purified by affinity chromatography (Hoop et al,. aio/Technology 6:1204. 1988). Purified recombinant 
human TNFd was purchased from R&D Systems (l\/linneapolis. MN). Both proteins were radiolabeled 
using the commercially available solid phase agent. lOOO-GEN (Pierce), in this procedure. 5 ug of 
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lOOO-GSN were olated at the bcrtom of a in . 7^ f,r„ . .,. • • 

wiih 75 111 or n 1 M I °' ^ '° ' '""^ y'"S x'oe ano mcuoated for minutes at a> 

«.ih 75 al or 0.1 M sodium phcspnate. pH 7 A and 20 ul (2 mCl) Na ^s, -h,s solution 1, 
transferred to a second glass tube containing 5 .g TNF. (or TNFa) in 45 .1 PBS for 20 Ses a! 4 • r 
me reaction fixture was fractionated by gel filtration on a 2 ml bed volume ofsechaderG 25 ,S 
ecuiliorated ,n Roswell Park Memorial institute (RPMI) , 640 medium --om^i^Z f^o, , ^ 
serum albumin (BSA). 0.2% (w^v) sodium azide and 20 mM He^esTH 7 iTh 

Dcoi Of ■^^\-TNF «a<; rtii,,f»w , ^- "e„es pH /.4 (binding med.-um). The f nai 

routinely 1 x 10» com/mmols TNF ^ a<..ivii/. me specitic activuy is 



• routinely 1 x lO^ cpm/mmole TNF. 

0 8. Binding to Intact Cells. Binding assays with intact -eiis wora • 
method, ceils were first grown eith'er in suspen ion rQl^^gTz ''bv^h. °' 
(e.g.. WI26-VA4. COS cells expressing the ?com°b,n'a«-TN; e ep or AteTeTc'J're ruSsi^'^.r 
removed by treatment with 5mM EDTA treatment for ten mmu es at azTg eel c^grade S 
assays were then performed by a pthalate oil separation method (Ocwer et a. //mSo/ ,32? ? 

5 84) essen«ally as described by Par. e, al. U Bio,. C.em. 26,Am. ,986). Non-s^b, S f 

azide fO ," ! Tr?;:"' T'"" °' ' °' °' ""'^^^ed TNF. sTcJum 

! (0-2%) was included m a binding assay to inhibit internalization of ^^'..tnf by cells In the second 
method. COS cells transfected with the INF-fl-containing piasmid, and expressing TNF receptors on t e 

^JrTJT^.^'T'"^ '"'"'^ °' '° ''^ ^'^"'y ^<^5°^''«« '° nitrocellulose from 

detergent extracts of numan cells yet retain TNF-binding activity provided a means of detecting TNF-R 
Cell extracts were prepared by mixing a cell pellet with a 2 x volume of PBS containing i% Triton X-IOO 
and a cocktail of protease inhibitors (2 mM phenylmethyl sulfonyl fluoride. 10 aM pepstatin 10 uM 
leupeptin. 2 mM o-phenanthroline and 2 mM EGTA) by vigorous vortexing. The mixture was incubated 
on ice for 30 minutes after which it was centrifuged at I2.000x g for IS minutes at S" C to remove nuclei 
and other debns. Two microliter aliquots of cell extracts were placed on dry 8A85/-21 nitrocellulose 
memoranes (Schleicher and Schuell. Keene. NH) and allowed to dry. The membranes were incubated in 
hssue culture dishes for 30 minutes in Tris (0.05 M) buffered saline (0.15 M) pH 7.5 containing 3% wv 
BSA to blocl< nonspecific binding sites. The membrane was then covered with 5 x 10"" M '^^[-Tt^F in 
PBS * 3% BSA and incubated for 2 hr at 4-C with shai<ing. At the end of this time, the membranes 
were washed 3 times in PBS. dried and placed on Kodak X-Omat AR film for 18 hr at -70* C. 

Example 2 



2l """'3" "TNF-R cDNA by Direct Expression of Active Protein in COS-7 Cells 

Various human cell lines were screened for expression of TNF-fl based on their ability to bind "i- 
labeled TNF. The human fibroblast cell line WI-26 VA4 was found to express a reasonable number of 
receptors per cell. Equilibrium binding studies showed that the cell line exhibited biphasic binding of '"l- 
TNF with approximately 4.000 high affinity sites (K. = i x 10'" M-) and 15.00 low affinity sites (K. = i x 
10' M-') per cell. 

An unsized cDNA library was constructed by reverse transcription of poiyadenylated mRNA isolated 
from total RNA extracted from human fibroblast WI-26 VA4 ceils grcwn in the presence of pokeweed 
mitogen using standard techniques (Gubler. et al.. Gene 25:263. 1983: .Ausubel et al.. eds Current 
Protocols in Molecular Biology. Vol. i. 1987). The cells were harvested by lysing the cells in a guanidine 
nydrocnioride solution and total RNA Isolated as previously described (March et al., Nature 3/5:641. i985). 

Poly A RNA was isolated by ollgo dT cellulose chromatography and double-stranded cDNA was 
prepared by a method similar to that of Gubler and Hoffman (Gene 25:263. :383). Briefly, the poly a" RNA 
was converted to an flNA<ONA hybrid by reverse transcriptase using oligo dT as a primer. The RNA-cDNA 
hybrid was then converted into double-stranded cONA using RNAase H ,n combination with ONA poly- 
merase I. The resulting double stranded cONA was blunt-ended with T4 CNA polymerase. To the blunt- 
ended cONA is added Ecofll linker-adapters (having internal A/on sites) which were phosphorylated on only 
one end (Invitrogeri). The linker-adaptered cDNA was treated with T4 polynucleotide kinase to 
phosphorylate the 5 overhanging region of the linker-adapter and ur.lic^tad linkers were removed by 
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A Practical Apprcacr,. Glover, e... Z^?^ ^1^^^;;:'^'^' '""''^'^ Cicnll 
particles .s,ng a commercially available .,t to g^ne at= ' .ih.! 

Systems. San Oiego. CA. USA). Recornbinams .ere ^"^^^^^^ recomc,nar,ts (Straragene C:cn,'g 

of £. con strain c600(hf|-). ampl.r.ecJ by plating pnage on a bacterial ;awn 

Phage DNA was purified from the resultina xrrin -nM4 , u, 
digestion .ith the restriction enzyme A/on Ft.owinq 1 ec^oo^o '^^^ 

cDNAs greater than 2.000 bp were isolated -'^<='^°P'^o^es,s of the digest through an agarose gel. 
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The resulting cONAs were iiqated into the ^nkarv^f,^ « 
resigned to express cDNA sequences insered a, its muiS^^^^^ '^^ ' • ""^-^ 

cells. pCAV.-NOT was assembled from pOC20. 'a Sr^a ve o om7L 

al.. Nawre 312: 758. ,984). SV40 and cytomeg^S sTNA^d^^^^^^^^ °' ^^^"^ 

of transcription from the origin of replica ion- SV4; seauen^^, . ? J" 
origin Of replication, enhancer sequences and earardTe p orerT°^^^^^^ '"'^'"'^'"^ 
including the promoter and enhancer regions (nucleoSs 7, o ° Tfrom Z°'""^ "^"''" ''"^'""^ 
Boechan et al (Cell 4; S2i Iqfi';^• n» »-eotraes o/i cO +63 from the sequence published by 

intron ^etw^een thTf^s^ InTsei si h^:;-^^^^^^^^^^^ - 

Of the tripartite leader and a multiple cloning 'te oSn no si "orTor. T ? 

Bs/l: (4, SV40 sequences from coordinates 4127^,00 i.d 2^r2s^ 

termination signals for early transcription: ,5) sequences ^rivS frTm 0^42?'^^^''^ 
sequences. VAl and VAll of pOC20i. with adenovirus sequences iSasTi /A^ f v.rus-assocated 
genes, followed by pBR322 sequences from 4363-24l6'id ,oU^^^^^^^^^^^^ 

gene and origin of replication i"a'»-J/5 containing the ampicillin resistance 



SI. i/vuc/. /^c/as wes. 7 7.1295. 1983) and McCutchan et al, (J, Natl. Cancer Inst 4;-3Si iQPifii th^ 

Slo supernatants were discarded and the cell monolayers in each plate as«ye^ tor TNF 

binding as foHows. Three ml of binding medium containing ,.2 x 10- M '«,,abeled FUG* rNrJal 
added to each plate and the plates incubated at 4*0 for ,20 minutes. This medium was men discard^ 

for 72 hours at -70 C using an intensifying screen. TNF binding activity was visualized on -he exoosed 
f.lms as a dark focus against a relatively uniform background ^ 
t.n, nno'i '^^f "'"'^^•^'^ ^^'OOO recombinants the library had been screened in this manner, one transfec- 
exposure. '° ""'"^ ^^'^ ^'^^^'^ ^P''''^"' '''^ ^=^=^3'°"' ^ 

A frozen stock of bacteria from the positive pool was then used to obtain plates of approximately i50 
S^d ■ St ° "^'^ °" -troceilulose filters, and the plates were hen sera eJ and 

plasmid ONA prepared and transfected as described above to identify a positive plate Bacteria frorJ 
individual colonies from the nitrocellulose replica of this plate were grown in^cVm' c^2es w" h le." 

I'nJl^T 7 • "^"'''"'^ « this manner a 

single Clone, clone , . was isolated which was capable of inducing expression of human TNF-R in COS cells 
The expression vector pCAV/NOT containing the TNF-R cDNA clone , has been deposited with ;he 
American Type Culture Collection. ,230, Parklawn Drive. Rockville. MO 208S2. USA (Accession No. 68088) 
I under the name pCAV-NOT-TNF-R. on 6th Sept. ,989. 
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A cONA encoctng a soluble huTNF-RA235 {having the soquence-o*- .-arruno acids 'i*535 of .-icL:re 2A)' 
was constructed' by excising an 840 bp fragnnent from pCAV.NOT-TNF-R with the restriction enzyrnes Notr 
and Pvu2. Noti cuts at the rriultipie cloning site of pCAV/NOT-TNF-R and Pvu2 cuts within the TNF-R 
coding region 20 rrucleotides 5 of the transmembrane region, in order to reconstruct the 3' end of the TNF- 
s R sequences, two oHgonucieotides were synthesized and annealed to create the following oligonucieotice 
linker: 



SamHl Bgl2 
CTGAAGGGAGCACTGOCGACX^GGATCCA 

GACTTCCCTCGTGACCGCTGATTCCTAGGTCTAG 
AlaGluGlvSerThrGlyAap gnd 



This oligonucleotide linker has terminal Pvu2 and 8gl2 restriction sites, regenerates 20 nucleotides of the 
^- TNF-R, followed by a termination codon (underlined) and a BamHl restriction site (for convenience in 
isolating the entire soluble TNF-R by Not1/8amHi digestion). This oligonucleotide was then ligated with the 
340 bp Not! Pvu2 TNF-R insert into 8gl2'Not1 cut pCAV/NOT to yield psoihuTNF-RA23SCAVNOT. which 
was transfected into COS-7 cells as described above. This expression vector induced expression of soluble 
human TNF-R which was capable of binding TNF. 



Example 4 



25 

Construction of cDNAs Encoding Soluble huTNF-RAi85 

A cDNA encoding a soluble huTNF-RAl85 (having the sequence of amino acids l-i85 of Figure 2A) 
was constructed by excising a 340 bp fragment from pCAV/NOT-TNF-R with the restriction enzymes Noti 
and Bgl2. Noti cuts at the multiple cloning site of pCAV/NO-TNF-R and 8gt2 cuts within the TNF-R coding 
region at nucleotide 637, which is 237 nucleotides 5 of the transmembrane region. The following 
oligonucleotide linkers were synthesized: 

35 Bgi2 

5 ' -GATCTGTAACGTGGTGGCCATCCCTGGGAATGCAAGCATGGATGC-3 * 
ACATTGCACCACCGGTAGGGACCCTTACGTTCG 
lieCyaAanVaiValAlalleProGlyAanAlaSerMetAspAia 

40 

Noci 

5 ' - AGTCTGCACGTCCACGTCCCCCACCCGGI2^GC - 3 ' 

TACCTACGTCAGACGTGCAGGTGCAGGGGGTGGGCCACTCGCCGG 
VaiCysThrSerThrSerProThrArgEnd 

The above oligonucleotide linkers reconstruct the 3 end of the receptor molecule up to nucleotide 708. 
followed by a termination codon (underlined). These oligonucleotides were then ligated with the 640 bp 
Noti TNF-R insert into Not! cut pCAV/NOT to yield the expression vector psolTNFRAi85/CAVNOT. which 
was transfected into COS-7 cells as described above. This expression vector induced expression of soluble 
human TNF-R which was capable of binding TNF. 
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Construction of cONAs Encoding Soluble huTNF-RAl63 
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A cONA-enccc.ng a soluble huTNF.RAi63 (having the s'equVn'ce of am.no acids M53 of =\'r ' 
was constructed by excs.ng a 640 bp fragment from from pCAV/NOT-TNF.R w,th the restriction 
Noti and Bgl2 as described in Example 4. The fotlow.ng oligonucleotide linkers were syn:hesiz4- 



agX2 Noel 
5 • -GATCTGTI2AGC -3' 
ACAACTCGCCGG 
IleCysEnd 



Th.s aoove oligonucleotide linker reconstructs the 3 end of the receptor molecule up to nucleotide 642 
ammo aod 163) followed by a termination codon (underlined). This oligonucleotide was then legated with 
the 040 bp Noti TNF-R insert into Noti cut pCAV/NOT to yield the expression vector 
PSOITNFRA163/CAVNOT. which was transfected into COS-T cells as described above ^hj expression 
vector induced expression of soluble human TNF-R which was capable of binding TNF in the bindirc assav 
described in Example i . ^ y ^^*iy 



Example 6 



Construction of cDNAs Encoding Soluble huTNF-RAl 42 



A cDNA encoding a soluble huTNF-RAi42 (having the sequence of amino acids 1-142 of Rgure 2A) 
was constructed by excising a 550 bp fragment from from pCAV/NOT-TNF-R with the restriction enzymes 
Noti and AlwNI. AlwNl cuts within the TNF-R coding region at nucleotide 549. The following 
oligonucleotide linker was synthesized: 

Bgi2 Noti 
5 ' -CTGAAACATCAGACGTGGTGTGCAAGCCCTGTI2kAA-3 ' 
CTTGACTTTGTAGTCTGCACCACACGTTCGGGACAAT7TCTAGA 

End 

This above oligonucleotide linker reconstructs the 3* end of the receptor molecule up to nucleotide 579 
(amino acid 142). followed by a termination codon (underlined). This oligonucleotide was then ligated with 
the 550 bp Notl/AlwNl TNF-R insert into Not1/Bgl2 cut pCAV-NOT to yield the expression vector 
psolTNFRAl42'CAVNOT, which was transfected into COS-7 cells as described above. This expression 
vector did not induced expression of soluble human TNF-R which was capable of binding TNF. It is 
believed that this particular construct failed to express biologically active TNF-R because one or more 
essential cysteine residue (e.g.. Cys^^' or Cys'-^) required for intramolecular bonding {for formation of the 
proper tertiary structure of the TNF-R molecule) was eliminated. 



Example 7 



Expression of Soluble TNF Receptors in CHO Celis 



Soluble TNF receptor was expressed in Chinese Hamster Ovary (CHO) cells using the glutamine- 
synthetase (GS) gene amplification system, substantially as described in PCT patent application Nos. 
WO87-04462 and WO89/01036. Briefly. CHO cells are transfected with an expression vector containing 
genes for both TNF-R and GS. CHO cells are selected for GS gene expression based on the ability of the 
transfected ONA to confer resistance to low levels of methionine sulphoximine (MSX). GS sequence 
amplification events in such cells are selected using elevated MSX concentrations In* this way. contiguous 
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TNF-R secuencss are aiso amplified and enhanced TNF-R express'on =5 acnif^ved. 

The vector xised in the GS expression system was psoiTNFaP6/PSVLGS. wh.ch was ccnstrucec as 
follows. First, the vector pSVLGS.i {described in PCT Application Nos. WO87.04462 and WOSa-Oioae ^-c 
available from Ceiltech. Ltd.. Berkshire. UK) was -cut w.m the 3amHi restriction enzyme and 
5 phosphorylated with calf intestinal alkaline phosphatase (CIAP) to prevent the vector from religating to .tseif. 
The BamHl cut pSVLGS.i fragment was then ligated to a 2.4 kb SamHi to Bgl2 fragment of pEEShCMV 
{descnbed in PCT Application No. WO89/01036. also available from Ceiltech) which was cut with 3gl2. 
BamHl and Fspt to avoid two fragments of similar size, to yield an n.2 kb vector cesignaied 
p6. PSVLGS.I. pSVLGS.i contains the giutamine synthetase selectable marker gene under control of the 
;o SV40 later promoter. The SamHl to Bgi2 fragment of pEEShCfVlV contains the human cytomegalovirus 
major immediate early promoter (hCMV). a polylinker. and the SV40 early poiyadenylation signal. The 
coding sequences for soluble TNF-R were added to p6/PSVLGS.l by excising a Noti to BamHl fragment 
from the expression vector psolTNFaCAVNOT (made according to Example 3 aoove). blunt ending wrth 
Klenow and ligating with Smal cut dephosphorylated p6/PSVLGS.l . thereby placing the solTNF-R coding 
:5 sequences under the control of the hCMV promoter. This resulted in a single plasmid vector in which the 
SV40/GS and hCMB/solTNF-R transcription units are transcribed in opposite directions. This vector was 
designated psolTNFR/P6/PSVLGS. 

PS0ITNFRP6/PSVLGS was used to transfect CHO-Kl cells (available from ATCC. Rochville. MO. under 
accession number CCL 61) as follows. A monolayer of CH0-K1 cells were grown to subconfluency \n 
20 Minimum Essential Medium (MEM) lOX (Gibco: 330-1 581 AJ) without giutamine and supplemented with 
10% dialysed fetal bovine serum (Gibco: 220-6300AJ), i mM sodium pyruvate (Sigma). MEM non-essential 
amino acids (Gibco: 320-n40AG), 500 uM asparagine and glutamate (Sigma) and nucleosides (30 uM 
adenosine, guanosine. cytidine and uridine and 10 llM thymidine)(Sigma). 

Approximately 1x10^ cells per 10 cm petri dish were transfected with 10 ug of psoiTNFR/P6/PSVLGS 
25 by standard calcium phosphate precipitation, substantially as described by Graham i van der £b. Virology 
52:456 (1983). Cells were subjected to glycerol shock (15% glycerol in serum-free culture medium for 
approximately i.S minutes) approximately 4 hours after transfection, substantially as described by Frost & 
Williams. Wro/ogy 9 ;:39 (1973), and then washed with serum-free medium. One day later, transfected cells 
were fed with fresh selective medium containing MSX at a final concentration of 25 uM. Colonies of MSX- 
20 resistant surviving cells were visible within 3-4 weeks. Surviving colonies were transferred to 24-well plates 
and allowed to grow to confluency in selective medium. Conditioned medium from confluent wells were 
then assayed for soluble TNF-R activity using the binding assay described in Example i above. These 
assays indicated that the colonies expressed biologically active soluble TNF-R. 

In order to select for GS gene amplification, several MSX-resistant cell lines are transfected with 
as psolTNFR/Pe/PSVLGS and grown in various concentrations of MSX. For each cell line, approximately ixio^ 
cells are plated in gradually increasing concentrations of 100 uM. 250 uM. 500 uM and i mM MSX and 
incubated for 10-14 days. After 12 days, colonies resistant to the higher levels of MSX appear. The 
surviving colonies are assayed for TNF-fl activity using the binding assay descnbed above in Example i. 
Each of these highly resistant ceil lines contains cells which arise from multiple independent amplification 
40 events. From these cells lines, one or more of the most highly resistant cells lines are isolated. The 
amplified cells with high production rates are then cloned by limiting dilution cloning. Mass cell cultures of 
the transfectants secrete active soluble TNF-R. 
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Expression of Soluble Human TNF-R in Yeast 



Soluble human TNF-R was expressed in yeast with the expression vector plXY432. which was derived 
from the yeast expression vector plXYl20 and plasmid pYEP3S2. plXYi20 is identical to pYaHuGM (ATCC 
53157). except that it contains no cDNA insert and includes a polylinker multiple cloning site with a Ncoi 
55 restriction site. 

A ONA fragment encoding TNF receptor and suitable for cloning into the yeast expression vector 
piXYl20 was first generated by polymerase chain reaction (PCR) amplification of the extracellular portion of 
the full length receptor from pCAVNOT-TNF-R (ATCC 68088). The following primers were used in this PCR 
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ampiificairon; 



5' gnrl Pn'm^r 

.o^euAapLysArgAapTvrLyaAap 
3' End Primer r^inrj^^n^^^ 



rs =cu^x.^na< rNT-R- 



20 



25 



30 



35 



40 



50 



55 



The 5 end oligonucleotide primer used in the amn(ifi^^.h«« ■« . ^ 
fClowed Cy nucecides encoding the V e . of T St ^Sl'" 

and mose encoding the 8 amino acids of theJLvGT o.ntl , I T ^ (Pro-Leu-Asp-Lys-Arg, 
sequence encoding ,he 5 end of the mature 4eotof The F?Ari ' '''''*'"*''^'^^^'''-'^^' '° 
6:1204. 1988) is a highly antigenic sequence wh!^S rLl« ^! '""^ ®' S'o/rec/"'o'ogy 

9259). The o-lgonucfeoLe 'sed .rgeJ ate ^ ' e d' '. r^^^^^ 

strand of ONA encoding sequences which terminL -h! PCR-denved fragment is the antisense 

704 Of the mature coLg region (roS gTe^l^^^^^^^ 

introducing a TAA stop codon (undell^J Th« 1 T ^'^^^'"^ transmembrane domain) by 
The ONA sequences encocHng lNF fl^ ''V ^ 8amHi restriction site 

e.pri::o:'.X^S.r^^^^^^^^ - su^Led into the yeast 

enzymes, diges^ng P.XYiaoU'Srrnd ^p^TJ^^^ 'aCent" tot"" 

wfro with T4 DNA llaase The rP^iniHnn r-A««t« / xT, ^ fragment into the cut vector in 

To aass sacruon o( lh« soluwe Human TNF reooiOf Imm oll«45» . , „ . 

diluted inTYl^^r" appropriate seiect-ve media. The PBH9.6b/plXY434 transformants were 

.r;;rceiVy;eS^^^^^^^^^^^^ ~V 

Ml ant. FLAG® antibody, excess antibody was removed by washing and then dilutions of horseradish 
peroxidase con.ugated anti-mouse IgG antibodies were incubated w,m ,he filters Af,e remo!aro? excess 

approximately 10 mmutes prior to removal of the substrate solution 

The anti.FLAG<» reactive material found in the supernatants demonstrated that significant levels of 



Id 



EP 0 418 014 A1 

, . 

receptor were secreted by both expression systems. Compansorrs derrcnstrated :har the ;IXY432 syst:^-! 
secreted approximately 3-16 times more soluble human TNF receptor than the plXY424 system. The 
supernatants were assayed for soluble TNF-R activity, as described in Example i. by the.r ability to bind 
'2Si-TNFa and block TNfa binding. The piXY432 supernatants were found to contain significant levels of 
5 active soluble TNF-R, 



Example 9 

Isolation of Murine TNF-R cONAs 



15 Murine TNF-R cDNAs were isolated from a cDNA library made from murine 7B9 cells, an antigen- 
dependent helper T cell line derived from C578L6 mice, by cross-species hybridization with a human TNF- 
R probe. The cONA library was constructed in x2AP (Stratagene. San Oiego). substantially as described 
above in Example 2. by isolating polyadenylated RNA from the 7B9 cells. 

A double-stranded human TNF-R cDNA probe was produced by excising an aoproximately 3.5 kb Noti 

20 fragment of the human TNF-R clone i and ^^P'\abei\r\Q the cONA using random primers (Boehringer- 
Mannheim). 

The murine cONA library was amplified once and a total of 900.000 plaques were screened, substan- 
tially as described in Example 2, with the human TNF-R cONA probe. Approximately 21 positive plaques 
were purified, and the Stuescript plasmids containing EcoRi -tinkered inserts were excised (Stratagene. San 
25 Diego). Nucleic acid sequencing of a portion of murine TNF-R clone 1 1 indicated that the coding sequence 
of the murine TNF-R was approximately 88% homologous to the corresponding nucleotide sequence of 
human TNF-R. A partial nucleotide sequence of murine TNF-R cONA clone 1 1 is set forth in Figures 3A-3B. 

30 Example 10 



Preparation of Monoclonal Antibodies to TNF-R 

as 

Preparations of purified recombinant TNF-R. for example, human TNF-R. or transfected COS cells 
expressing high levels of TNF-R are employed to generate monoclonal antibodies against TNF-R using 
conventional techniques, for example, those disclosed in U.S. Patent 4.411.993. Such antibodies likely to be 
40 useful in interfering with TNF binding to TNF receptors, for example, in ameliorating toxic or other 
undesired effects of TNF, or as components of diagnostic or research assays for TNF or soluble TNF 
receptor. 

To immunize mice. TNF-R immunogen is emulsified in complete Freund's adjuvant and injected in 
amounts ranging from 10-100 ug subcutanecusly into Balb/c mice. Ten to twelve days later, the immunized 

45 animals are boosted with additional immunogen emulsified in incomplete Freund's adjuvant and periodically 
boosted thereafter on a weekly to biweekly immunization schedule. Serum samples are penodically taken 
by retro-orbifat bleeding or tail-tip excision for testing by dot-blot assay (antibody sandwich) or ELISA 
(enzyme-linked immunosorbent assay). Other assay procedures are also suitable. Following detection of an 
appropriate antibody titer, positive animals are given an intravenous injection of antigen in saline. Three to 

so four days later, the animals are sacrificed, splenocytes harvested, and fused to the murine myeloma cell 
line NSi. Hybridoma cell lines generated by this procedure are plated in multiple microtiter plates in a HAT 
selective medium (hypoxanthine. aminopterin. and thymidine) to inhibit proliferation of non-fused cells, 
myeloma hybrids, and spleen cell hybrids. 

Hybridoma clones thus generated can be screened by ELISA for reactivity with TNF-R. for example, by 

55 adaptations of the techniques disclosed by Engvall et al., Immunochem. S:87i (I97i) and in U.S. Patent 
4.703.004. Positive clones are then injected into the peritoneal cavities of syngeneic Salb/c mice to produce 
ascites containing high concentrations (>i mg/ml) of anti-TNF-R monoclonal antibody. The resulting 
monoclonal antibody can be purified by ammonium sulfate precipitation followed by gel exclusion 
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cnrcmatosraphy. anfi.or affinity chromatography based on oir.ding-of antibody to Protein A of 5.^"o>,', 
cus aureus. ^'i-uuy lo r-.otem a ot ^tapnyioccc- 



Claims 



? 'iTTt nT. '"""^'"^ ' biolog-cally active mammalian TNF receptor (TNF-R) protein 

Z ' ONA .n T""" '° '• '''''''' Sroup oons,sti g Of: ' 

^NF . 4ne ' ' ^^''"^"'^^ '^^'^^^ ''^"^ '"^ ""^"'"^ ^^^ion of a native mammalian 



gene; 

is^^raTSn'/nVTf °' '° 0' "Hder moderately stringent conditions 

(50 0, 2 X SSC) and wmch encode biologically active TNF.fl prctem- and ' ^ ■ 

(C) ONA sequences which are degenerate as a result of the genetic code to the ONA sequenc-s cefin^H 
(a) and (b) and Ahich encode biologically active TNF-R protein sequences defined 

" I' Tn 'IT^. '""'"'"^ '° ' "''"•^ ^ ^O"^'''^ '^"'^an TNF-R protein 

4. An isolated ONA sequence according to claim 3. wherein the soluble human TNF-R protein has an amino 
acid sequence comprises the sequence of amino acid residues , -x of Rgure 2A. where'n is selected "ir^ 
the group consisting of amino acids 163-235 seiectec .rom 

5. An isolated ONA sequence according to claim 3. wherein the soluble human TNF-R protein comprises 
20 the sequence of ammp acids 1-235 of Figure 2A. comprises 

6. A ONA sequence according to claim 5. wherein amino acid residue 46 is selected from the qrouo 

consisting.of lie and Thr and amino acid residue 1 18 is selected from the group consisting of Val and I 

7^ An isolated ONA sequence according to claim 3. wherein the soluble human TNF-R protein comprises 
the sequence of amino acids 1-185 of Rgure 2A. ^.ompnses 

25 8. An isolated ONA sequence according to claim 3. wherein the soluble human TNF-R protein comprises 
the sequence of amino acids i-i63 of Rgure 2A. k u y 

9. A recombinant expressipn vector comprising a ONA sequence according to any one of claims 1-8 
10 A process for preparing a biologically active mammalian TNF receptor (TNF-R) protein, comprisinq 
cultunng a suitable hpst cell comprising a vector according to claim 3 under conditions promoting 

JO expression, ' 

11 . A purified biologically active mamnnaiian TNF receptor (TNF-R) protein. 

12. A purified biologicaliy active soluble hunnan TNF-R protein. 

13. A purified biologically active TNF-R protein according to claim 12. comprising the sequence of amino 
ac(d residues l -235 of Rgure 2A, 

35 14. A purified biologically active TNF-R protein according to claim 12. comprising the sequence of amino 
acid residues 1-185 of Rgure 2A. 

15- A purified biologically active TNF-R protein according to claim 12. comprising the sequence of amino 
acid residues 1-163 of Rgure 2A. 

16. The use of a mammalian TNF-R protein in preparing a medicament for regulating immune responses in 
40 mammals. 

1 7. The method of claim 18, wherein the TNF-R protein is human TNF-R and the mammal to be treated is a 
human. 

18. The use of mammalian TNF-R protein in preparing a pharmaceutical composition suitable for parenteral 
administration to a human patient for regulating immune responses. 

45 19. A process for detecting TNF or TNF-R molecules or the interaction thereof, comprising use of a 
mammalian iNF receptor protein, a soluble TNF receptor protein capable of binding TNF or substantially 
similar TNF-R analog produced by recombinant cell culture. 
20. Antibodies immunoreactive with mammalian TNF receptors. 

so Cfaims for the following Contracting State; ES 

K A process for preparing a purified mammalian TNF receptor (TNF-R) protein, the process comprising 
coupling together successive amino acid residues by the formation of peptide oonds to form a TNF-fl 
polypeptide. 

55 2. A process according to claim l . wherein the TNF-R protein is a soluble human TNF-R protein, 

3. A process according to claim 2, wherein the soluble TNF-R protein has an ammo acid sequence 
comprising the sequence of amino acid residues i-x of Rgure 2A. wherein x is selected from the group 
consisting of amino acids 163-235. 
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^. A process according :o claim 3. wherein the soluble TNF-V^ prpteioJJpas in amto .acid' sequence /^nch' 
comprises :he sequence of amino acid residues 1-235 of Figure 2A. 

5. A process according to claim 3. wherein the soluble TNF-R protein has an' amio acid sequence which 
comprises the sequence of amino acid residues i -185 .of Figure 2A. 
5 5. A process according to claim 3, wherein the soluble TNF-R protein has an amio acid sequence which 
comprises the sequence of amino acid residues i-i63 of Figure 2A. 

7. The use of a mammalian TNF-R protein in prepanng a medicament for regulaiing immune responses in 
mammals. 

3. The use of a mammalian TNF-R protein in preparing a pharmaceutical composition suitable for parenteral 
?o administration to a human patient for regulating immune responses. 

9. A process for preparing a ONA sequence encoding a mammalian TNF receptor (TNF-R) protein, the 
process comprising coupling together successive nucleotide residues. 

10. A process for preparing a ONA sequence according to claim 9. wherein the ONA sequence encodes a 
soluble human TNF-R protein. 

rs 11. A process for preparing a ONA sequence according to claim 10. wherein the ONA sequence encodes a 
soluble TNF-R protein having an amino acid sequence comprising the sequence of amino acid residues i-x 
of Figure 2A. wherein x is selected from the group consisting of amino acids 163-235. 

12. A process for preparing a ONA sequence according to claim 10. wherein the ONA sequence encodes a 
soluble TNF-R protein having an amio acid sequence which comprises the sequence of amino acid 

20 residues 1-235 of Fgure 2A. 

13. A process for preparing a ONA sequence according to claim 10. wherein the ONA sequence encodes a 
soluble TNF-R protein having an amio acid sequence which comprises the sequence of amino acid 
residues 1-185 of Figure 2A. 

14. A process for preparing a ONA sequence according to claim 10. wherein the ONA sequence encodes a 
25 soluble TNF-R protein having an amio acid sequence which comprises the sequence of amino acid 

residues 1-163 of Figure 2A. 

1 5. A process for preparing a ONA sequence according to claim 9, said ONA being selected from the group 
consisting of: 

(a) cONA clones having a nucleotide sequence derived from the coding region of a native mammalian 
30 TNF-R gene; 

(b) ONA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
(50* C, 2 X SSC) and which encode biologically active TNF-R protein: and 

(c) ONA sequences which are degenerate as a result of the genetic code to the ONA sequences defined 
in (a) and (b) and which encode biologically active TNF-R protein. 

js 16. A process for preparing a ONA sequence according to claim 9. said ONA encoding a TNF-R protein 
having the sequence of amino acids of the TNF-R protein expressed by pCAV-NOT-TNF-R (ATCC 68088). 

17. A process for preparing a recombinant expression vector, comprising llgating bacterial, yeast or 
mammalian expression vector ONA and a ONA sequence encoding a human TNF-R protein sequence. 

18. A process for preparing a mammalian TNF-R or an analog thereof, comprising culturing a suitable host 
40 cell comprising a vector prepared according to claim 1 7 under conditions promoting expression. 

19. A process for detecting TNF or TNF-R protein molecules or the interaction thereof, comprising use of a 
mammalian TNF-R protein, a soluble TNF-R protein capable of binding TNF or substantially similar TNF-R 
analog produced by recombinant cell culture. 

20. A process for the preparation of antibodies immunoreactive with TNF receptor, the process comprising 
either (a) culturing a hybridoma cell expressing the antibodies and harvesting the antibodies, or (b) 
harvesting antibodies immunoreactive with TNF receptor from an appropriately immunised animal. 

Claims for the following Contracting State: GR 

50 1. An isolated ONA sequence encoding a biologically active mammalian TNF receptor (TNF-R) protein. 

2. An isolated ONA sequence according to claim i. selected from the group consisting of: 

(a) cONA clones having a nucleotide sequence derived from the coding region of a native mammalian 
TNF-R gene: 

(b) ONA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
55 (50* C, 2 X SSC) and which encode biologically active TNF-R protein: and 

(c) ONA sequences which are degenerate as a result of the genetic cede to the ONA sequences defined 
in (a) and (b) and which encode biologically active TNF-R protein. 

3. An isolated ONA sequence according to claim i which encodes a soluble human TNF-R protein. 



21 



EP 0 418 014 a: 

... , 

; .• • - : ; ' 

^^An isoiaiea.ONA sequencs accorc.ng :o cla.m 3. wherem the soluble human rNF-R orots.n ras an '.m,rn 
c,d secuencs compnsmg :ne seauence of amino acia residues i-x of Figure 2A .he ein . s J^'Z 
from the group consisting of amino acics 163-235 selected 

1 An isolated DNA sequence according to claim 3. wherein the soluble human TNF-a orotein comor.se. 
= the sequence of amino acids 1-235 of Figure 2A. comprises 

°Jn!.!? '° 3. wherein the soluble human TNF-R protein comcrises 

the sequence of ammo acids 1-185 of Figure 2A. cgmcnses 

■L ONA sequence according to claim 3. wherein the soluble human INF-fl protem comprises 

•.he sequence or ammo acids i -163 of Figure 2A. comprises 

.•0 8. A ONA sequence according to claim 3. wherein amino acid residue 46 is seisctert 'mm . . 

consisting of lie and Thr and amino acid residue i ,8 is selected f^om me ro^p cp "^g of d le ' 
^ A recombinant expression vector comprising a ONA sequence according to ly one of'ciaims , 7 

cauaiiZ^.Z^Z ' ""'""^ ^""^ ^^'^P'^' < ' ''^^ .=^°"ss comorising 

.5 polypeptide °' '° ^ ''^F-a 

11. A process according to claim 9. wherein the TNF-R protein is a soluble human TNF-R orotein 

12. A process according to claim n, wherein the soluble human INF-fl protein has an amino acid 
sequence comprising the sequence of amino acid residues i-x of Figure 2A. wherein x is selected from the 
group consisting of amino acids 163-235. 

JO 13. A process according to claim 11. wherein the soluble human TNF-H protein has an amio acid sequence 
which comprises the sequence of amino acid residues 1 -235 of Figure 2A 

14. A process according to claim n. wherein the soluble human TNF-R protein has an amio acid sequence 
which comprises the sequence of amino acid residues 1-185 of Rgure 2A 

15. A process according to claim 1 1. wherein the soluble human TNF-R protein has an am,o acid sequence 
25 which comprises the sequence of amino acid residues 1 -163 of Rgure 2A. 

16. The use of a mammalian TNF-R protein in preparing a medicament for regulating immune responses in 

nnammalS- 

17. The use of a mammalian TNF-R protein in preparing a pharmaceutical composition suitable for 
parenteral administration to a human patient for regulating immune responses. 

30 18. Antibodies immunoreactive with mammalian TNF receptors. 
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(2) INrORHATlCS "OR SEQ ID N0:1: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1541 base pairs 

(B) TYPE: nucleic acid 

(C) StRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cONA to mRNA 

(iii) HYPOTHETICAL: M 

(iv) ANTX -SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(G) CELL TYPE: ribroblast 

(H) CELL LINE: WI-26 VA4 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: WI-2S VA4 

(B) CLONE: 1 

Ux) FEATURE: 

(Al NAME/KEY: CDS 

(B) LOCATION: 88.. 1473 

(D) OTHER INFORMATION: 
(ix) TEATURE: 

(A) NAME/KEY: mac^peptide 

(B) LOCATION: 38.. 1470 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 38..1S3 
(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Smith , Craig A. 

Davia, Terri 
Anderson, Dirk 
SoXam, Lisabeth 
3ec}unann, M. P. 
Jerzy, Rita 
Dower, Steven K. 
Cosman, David 
Goodwin, Elaytnond G. 

(B) TITLE: A Receptor for Tumor Necrosis Factor Defines an 

Unusual Family of Cellular and Viral Proteins 

(C) JOURNAL: Science 

(D) VOLUME: 248 

(F) PAGES: 1019-1023 

(G) DATE: 2S-MAY-1990 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GCGAGGCAGG CAGCCTGGAG AGAAGGCGCT GGGCTGCGAG GGCGCGAGGG CGCGAGGGCA 50 

GGGGGCAACC GGACCCCGCC CGCATCC ATG GCG CCC GTC GCC GTC TGG GCC 111 

Met Ala Pro Val Ala Val Trp Ala 
1 5 

GCG CTG GCC GTC GGA CTG GAG CTC TGG GCT GCG GCG CAC GCC TTG CCC 159 
Ala Leu Ala Val Gly Leu Glu Leu Trp Ala Ala Ala His Ala Leu Pro 
10 15 20 
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GCC CAG GTG GCA tTT ACA CCC TAG GCC CCG GAG CCC GGG AGO ACA TGC ' 
Ala Gin vai Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cya 
23 30 35 40 

CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT CAG ATG TGC TGC AGC AAA 
Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys Ser Lys 
45.. 50 55 

TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC TCG GAC 
Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr Ser Aso 
60 65 70 

ACC GTG TGT -GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC TGG AAC 
Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu Trp Asn 
75 80 85 

TGG GTT CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT GAC CAG 
Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser Asp Gin 
90 95 iOO 

GTG GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC ACC TGC 
Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys Thr Cys 
105 110 115 120 

AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC CGG CTG 
Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys Arg Leu 
125 130 135 

TGC GCG CCG CTG CGC' AAG TGC CGC CCG GGC TTC GGC GTG GCC AGA CCA 
Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala Arg Pro 
140 145 150 

GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC CCG GGG ACG 
Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro Gly Thr 
155 160 165 

TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG CCC CAC CAG ATC 
Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His Gin lie 
170 175 180 

TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC ATG GAT GCA GTC TGC 
Cys Asn Val Val Ala lie Pro Gly Asn Ala Ser Mec Asp Ala Val Cys 
135 190 195 200 

ACG TCC ACG TCC CCC ACC CGG ACT ATG GCC CCA GGG GCA GTA CAC TTA 
Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val His Leu 
205 210 215 

CCC CAG CCA GTG TCC ACA CGA TCC CAA CAC ACG CAG CCA ACT CCA GAA 
Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr Pro Glu 
220 225 230 

CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC CCC AGC 
Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro- Met Gly Pro Ser 
235 240 245 

CCC CCA GCT GAA GGG AGC ACT GGC GAC TTC GCT CTT CCA GTT GGA CTG 
Pro Pro Ala Glu Gly Ser Thr Gly Asp Phe Ala Leu Pro Val Gly Leu 
250 255 260 

ATT GTG GGT GTG ACA GCC TTG GGT CTA CTA ATA ATA GGA GTG GTG AAC 
lie Val Gly Val Thr Ala Leu Gly Leu Leu lie lie Gly Val Val Asn 
265 270 275 280 
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TGT GTC ATC ATG ACC CAG GTG AAA AAG AAG CCC TTG TGC CTG CAG AGA 97 5 

Cys Vai lie Met Thr Gin Vai Lys Lys Lys Pro Leu Cys Leu Gin Arg 
235 ZSO 295 

GAA GCC AAG GTG CCT CAC TTG CCT GCC GAT AAG GCC CGG GGT ACA CAG 1023 
Giu Aia Lys Vai Pro Hi3 Leu Pro Ala Aap Lys Ala Arg Giy Thr Gin 
300 305 310 

• GGC CCC GAG CAG CAG CAC CTG CTG ATC ACA GCG CCG AGC TCC AGC AGC 1071 
Gly Pro Glu Gin Gin His Leu Leu He Thr Ala Pro Ser Ser Ser Ser 
315 320 325 

AGC TCC CTG GAG AGC TCG GCC AGT GCG TTG GAC AGA AGG GCG CCC ACT 1119 
Ser Ser Leu Glu Ser Ser Ala Ser Ala Leu Asp Arg Arg Aia Pro Thr 
330 335 340 

CGG AAC CAG CCA CAG GCA CCA GGC GTG GAG GCC AGT GGG GCC GGG GAG 1157 
Arg Asn Gin Pro Gin Ala Pro Giy Vai Glu Ala Ser Gly Ala Gly Glu 
345 350 355 360 

GCC CGG GCC AGC ACC GGG AGC TCA GAT TCT TCC CCT GGT GGC CAT GGG 1215 
Ala Arg Ala Ser Thr Gly Ser Ser Asp Ser Ser Pro Gly Gly His Giy 
365 370 375 

ACC CAG GTC AAT GTC ACC TGC ATC GTG AAC GTC TGT AGC AGC TCT GAC 1263 
Thr Gin Vai Asn Vai Thr Cys He Vai Asn Vai Cys Ser Ser Ser Asp 
380 385 390 

CAC AGC TCA CAG TGC TCC TCC CAA GCC AGC TCC ACA ATG GGA GAC ACA 1311 
His Ser Ser Gin Cys Ser Ser Gin Ala Ser Ser Thr Met Giy Asp Thr 
395 400 405 

GAT TCC AGC CCC TCG GAG TCC CCG AAG GAC GAG CAG GTC CCC TTC TCC 1359 
Asp Ser Ser Pro Ser Glu Ser Pro Lys Asp Glu Gin Vai Pro Phe Ser 
410 415 420 

AAG GAG GAA TGT GCC TTT CGG TCA CAG CTG GAG ACG CCA GAG ACC CTG 1407 
Lys Glu Glu Cys Ala Phe Arg Ser Gin Leu Glu Thr Pro Glu Thr Leu 
425 430 435 440 

CTG GGG AGC ACC GAA GAG AAG CCC CTG CCC CTT GGA GTG CCT GAT GCT 14 55 

Leu Gly Ser Thr Glu Glu Lys Pro Leu Pro Leu Gly Vai Pro Asp Aia 
445 450 455 

GGG ATG AAG CCC AGT TAA CCAGGCCGGT GTGGGCTGTG TCGTAGCCAA 1503 
Gly Met Lys Pro Ser 
460 

GGTGGGCTGA GCCCTGGCAG GATGACCCTG CGAAGGGGCC CTGGTCCTTC CAGGCCCCCA 1563 

CCACTAGGAC TCTGAGGCTC TTTCTGGGCC AAGTTCCTCT AGTGCCCTCC ACAGCCGCAG 162 3 

CCTCCCTCTG ACCTGCAG 1641 



(2) INFOEWATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 62 axaino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
15 10 15 

Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 
20 25 30 

Ala Pro Glu Pro Gly Ser Thr Cya Arg Leu Arg Glu Tyr Tyr Aap Gin 
35 40 45 

Thr Ala Gin Met Cys Cya Ser Lys Cys Ser Pro Gly Gin His Ala Lya 
50 55 60 



Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 

65 70 75 80 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cys Leu Ser Cys 
85 90 95 

Gly Ser Arg Cys Ser Ser Asp Gin Val Glu Thr Gin Ala Cys Thr Arg 
100 105 110 

Glu Gin Asn Arg lie Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 
115 120 125 

Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 135 140 

Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 
145 150 155 160 

Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr 
165 170 175 

Asp He Cys Arg Pro His Gin He Cys Asn Val Val Ala He Pro Gly 
180 185 190 

Asn Ala Ser Met Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Ser 
195 200 205 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Ser 
210 215 220 

Gin His Thr Gin Pro Thr Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser 
225 230 235 240 

Phe Leu Leu Pro Met Gly Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly 
245 250 255 

Asp Phe Ala Leu Pro Val Gly Leu He Val Gly Val Thr Ala Leu Gly 
260 265 270 

Leu Leu He He Gly Val Val Asn Cys Val He Met Thr Gin Val Lys 
275 280 285 

Lys Lys Pro Leu Cys Leu Gin Arg Glu Ala Lys Val Pro His Leu Pro 
290 295 300 
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Aia Asp Ly3 Aia Ar? Giy Thr Gin Giy Pro Giu Gin Gin Kij Leu Le-i- 
305 310 315 320 

lie Thr Aia Pro Ser Ser Ser Ser Ser Ser Leu Giu Ser Ser Aia Ser 
325 330 335 

Aia Leu Aap Arg Arg Aia Pro Thr Arg Aan Gin Pro Gin Aia Pro Giy 
340 345 350 

Vai Giu Aia Ser Giy Aia Giy Giu Aia Arg Aia Ser Thr Giy Se- Ser 
355 360 365 

Asp Ser Ser Pro Giy Giy His Giy Thr Gin Vai Aan Vai Thr Cya lie 
370 375 380 

Vai Aan Vai Cys Ser Ser Ser Asp Hia Ser Ser Gin Cys Ser Ser Gin 
385 390 395 400 

Aia Ser Ser Thr Met Giy Asp Thr Aap Ser Ser Pro Ser Giu Ser Pro 
405 410 415 

Lys Asp Giu Gin Vai Pro Phe Ser Lys Giu Giu Cys Aia ?he Arg Ser 
420 425 430 

Gin Leu Giu Thr Pro Giu Thr Leu Leu Giy Ser Thr Giu Giu Lys Pro 
435 440 445 

Leu Pro Leu Giy Vai Pro Asp Aia Giy Met Lys .Pro Ser 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3813 base pairs 
(B) TYPE: nucleic acid 
K:) STRANDEDNESS : singie 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA to mRNA 

(iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(B) STRAIN: C57aL/6 

(G) CELL TYPE: T-heiper ceil 

(H) CELL LINE: 739 

(vii) IMMEDIATE SOURCE: 
{B) CLONE: 11 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 55.. 1479 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

ik) NAME/KEY: matj^eptide 
(B) LOCATION: 55.. 1476 
(D) OTHER INFORMATION: 
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tlx) cSlATURE: 

(A) NAMX/KEY: aig^peptide 

(B) LOCATION: 55.Tl20 



(D) OTHiR INFORMATION: 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CGCAGCTGAG GCACTAGAGC TCCAGGCACA AGGGCGGGAG CCACCGCTGC CCCT ATG 

Met 

1 

GCG CCC GCC GCC CTC TGG GTC GCG CTG GTC TTC GAA CTG CAG CTG TGG 

Ala Pro Ala Aia Leu Trp Val Ala Leu VaJ. Phe Glu Leu Gin Leu Trp 

5 10 IS 

GCC ACC GGG CAC ACA GTG CCC GCC CAG GTT GTC TTG ACA CCC TAC AAA 

Ala Thr Gly His Thr Val Pro Ala Gin Val Val Leu Thr Pro Tyr Ly3 

20 25 30 

CCG GAA CCT GGG TAC GAG TGC CAG ATC TCA CAG GAA TAC TAT GAC AGG 

Pro Glu Pro Gly Tyr Glu Cya Gin lie Ser Gin Glu Tyr Tyr Asp Arg 
35 40 45 

AAG GCT CAG ATG TGC TGT GCT AAG TGT CCT CCT GGC CAA TAT GTG AAA 

Lys Ala Gin Met Cys Cys Ala Lys Cys Pro Pro Gly Gin Tyr Val Lys 

50 • 55 60 65 

CAT TTC TGC AAC AAG ACC TCG GAC ACC GTG TGT GCG GAC TGT GAG GCA 

His Phe Cys ksn Lys Thr Ser Asp Thr Val Cys Ala Asp Cys Glu Aia 

70 75 80 

AGC ATG TAT ACC CAG GTC TGG AAC CAG TTT CGT ACA TGT TTG AGC TGC 

Ser Met Tyr Thr Gin Val Trp Asn Gin Phe Arg Thr Cys Leu Ser Cys 

85 90 95 

AGT TCT TCC TGT ACC ACT GAC CAG GTG GAG ATC CGC GCC TGC ACT AAA 

Ser Ser Ser Cys Thr Thr Asp Gin Val Glu tie Arg Ala Cys Thr Lys 

100 105 110 

CAG CAG AAC CGA GTG TGT GCT TGC GAA GCT GGC AGG TAC TGC GCC TTG 

Gin Gin Asn Arg Vai Cys Ala Cys Glu Ala Gly Arg Tyr Cys Ala Leu 
115 120 125 

AAA ACC CAT TCT GGC AGC TGT CGA CAG TGC ATG AGG CTG AGC AAG TGC 

Lys Thr His Ser Gly Ser Cys Arg Gin Cys Met Arg Leu Ser Lys Cys 

130 135 140 145 

GGC CCT GGC TTC GGA GTG GCC AGT TCA AGA GCC CCA AAT GGA AAT GTG 

Gly Pro Gly Phe Gly Vai Aia Ser Ser Arg Ala Pro Asn Gly Asn Val 

150 155 160 

CTA TGC AAG GCC TGT GCC CCA GGG ACG TTC TCT GAC ACC ACA TCA TCC 

Leu Cys Lys Ala Cys Ala Pro Gly Thr Phe Ser Asp Thr Thr Ser Ser 

165 170 175 

ACT GAT GTG TGC AGG CCC CAC CGC ATC TGT AGC ATC CTG GCT ATT CCC 

Thr Asp Vai Cys Arg Pro His Arg lie Cys Ser lie Leu Ala lie Pro 

180 185 190 
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GGA AAT GCA AGC ACA GAT GCA GTC TGT GCG CCC GAG TCC CCA ACT CTA 
Giy Aan Aia Ser Thr Aap Ala Val Cys Ala ?ro Glu Se- Pro '^h^ Leu 
195 ^ 200 205 



631 



AGT GCC ATC CCA AOG ACA CTC TAC GTA TCT CAG CCA GAG CCC ACA AGA 7 29 

Ser Aia lie ?ro Arg Thr Leu Tyr Vai Ser Gin Pro Glu Pro Th- Ara 
2i0 215 220 " -.^c 



220 225 

TCC CAA CCC CTG GAT CAA GAG CCA GGG CCC AGC CAA ACT CCA AGC ATC 

Ser Gin Pro Leu Aap Gin Glu Pro Gly Pro Ser Gin Thr Pro Ser He 

230 235 240 



777 



CTT ACA TCG TTG GGT TCA ACC CCC ATT ATT GAA CAA AGT ACC AAG GGT 325 
Leu Thr Ser Leu Gly Ser Thr Pro lie lie Glu Gin Ser Thr Lya Gly 
245 250 255 

GGC ATC TCT CTT CCA ATT GGT CTG ATT GTT GGA GTG ACA TCA CTG GGT 87 3 

Gly He Ser Leu Pro lie Gly Leu lie Val Gly Vai Thr Ser Leu Gly 
260 265 270 

CTG CTG ATG TTA GGA CTG GTG AAC TGC ATC ATC CTG GTG CAG AGG AAA 921 
Leu Leu Met Leu Gly Leu Val Asn Cys lie lie Leu Vai Gin Arc Lys 
215 280 235 

AAG AAG CCC TCC TGC CTA CAA AGA GAT GCC AAG GTG CCT CAT GTG CCT 969 
Ly3 Lys Pro Ser Cys Leu Gin Arg Aap Ala Lys Val Pro His Val Pro 
290 - 295 300 305 

GAT GAG AAA TCC CAG GAT GCA GTA GGC CTT GAG CAG CAG CAC CTG TTG 1017 
Asp Glu Lys Ser Gin Asp Ala Vai Gly Leu Glu Gin Gin His Leu Leu 
310 315 320 

ACC ACA GCA CCC AGT TCC AGC AGC AGC TCC CTA GAG AGC TCA GCC AGC 1065 
Thr Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala Ser 
325 330 335 

GCT C-GG GAC CGA AGG GCG CCC CCT GGG GGC CAT CCC CAA GCA AGA GTC 1113 
Ala Gly Asp Arg Arg Ala Pro Pro Gly Gly His Pro Gin Aia Arg Val 
340 345 350 

ATG GCG GAG -GCC CAA GGG TTT CAG GAG GCC CGT GCC AGC TCC AGG ATT 1161 
Met Ala Glu Ala Gin Gly ?he Gin Glu Ala Arg Ala Ser Ser Arg He 
355 360 365 

TCA GAT TCT TCC CAC GGA AGC CAC GGG ACC CAC GTC AAC GTC ACC TGC 1209 
Ser Asp Ser Ser His Gly Ser His Gly Thr His Val Asn Val Thr Cys 
370 375 380 385 

ATC GTG AAC GTC TGT AGC AGC TCT GAC CAC AGT TCT CAG TGC TCT TCC 1257 
lie Vai Asn Vai Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser Ser 
390 395 400 

CAA GCC AGC GCC ACA GTG GGA GAC CCA GAT GCC AAG CCC TCA GCG TCC 1305 
Gin Ala Ser Ala Thr val Gly Asp Pro Asp Ala Lys Pro Ser Ala Ser 
405 410 415 

CCA AAG GAT GAG CAG GTC CCC TTC TCT CAG GAG GAG TGT CCG TCT CAG 1353 
Pro Lys Asp Glu Gin Vai Pro Phe Ser Gin Glu Glu Cys Pro Ser Gin 
420 425 430 
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TCC CCG TGT GAG ACT ACA GAG ACA CTG GAG AGC CAT GAG AAG CCC TTG 1401 
Ser Pro Cys Glu Thr Thr Giu Thr Leu Gin Ser His Glu Lys Pro Leu 
435 440 445 

CCC CTT GGT GTG CCG GAT ATG GOC ATG AAG CCC AGC CAA GCT GGC TGG 1449 
Pro Leu Gly Vai Pro Aap Met Gly Met Lys Pro Ser Gin Ala Gly Trp ' 
450 455 460 465 

TTT GAT CAG ATT GCA GTC AAA GTG GCC TGA CCCCTGACAG GGGTAACACC 1499 
Phe Asp Gin lie Aia Vai Lys Vai Ala . 

470 475 

CTGCAAAGGG ACCCCCGAGA CCCTGAACCC ATGGAACTTC ATGACTTTTG CTGGATCCAT 1 = 59 

TTCCCTTAGT GGCTTCCAGA GCCCCAGTTG CAGGTCAAGT GAGGGCTGAG ACAGCTAGAG 1619 

TGGTCAAAAA CTGCCATGGT GTTTTATGGG GGCAGTCCCA GGAAGTTGTT GCTCTTCCAT 167 9 

GACCCCTCTG GATCTCCTGG GCTCTTGCCT GATTCTTGCT TCTGAGAGGC CCCAGTATTT 17 39 

TTTCCTTCTA AGGAGCTAAC ATCCTCTTCC ATGAATAGCA CAGCTCTTCA GCCTGAATGC 1799 

TGACACTGCA GGGCGGTTCC AGCAAGTAGG AGCAAGTGGT GGCCTGGTAG GGCACAGAGG 1959 

CCCTTCAGGT TAGTGCTAAA CTCTTAGGAA GTACCCTCTC CAAGCCCACC GAAATTCTTT 1919 

TGATGCAAGA ATCAGAGGCC CCATCAGGCA GAGTTGCTCT GTTATAG<3AT GGTAGGGCTG 1979 

TAACTCAGTG GTCCAGTGTG CTTTTAGCAT GCCCTGGGTT TGATCCTCAG CAACACATGC 2039 

AAAACGTAAG TAGACAGCAG ACAGCAGACA GCACAGCCAG CCCCCTGTGT GGTTTGCAGC 2099 

CTCTGCCTTT GACTTTTACT CTGGTGGGCA CACAGAGGGC TGGAGCTCCT CCTCCTGACC 2159 

TTCTAATGAG CCCTTCCAAG GCCACGCCTT CCTTCAGGGA ATCTCAGGGA CTGTAGAGTT 2219 

CCCAGGCCCC TGCAGCCACC TGTCTCTTCC TACCTCAGCC TGGAGCACTC CCTCTAACTC 2279 

CCCAACGGCT TGGTACTGTA CTTGCTGTGA CCCCAACGTG CATTGTCCGG GTTAGGCACT 2339 

GTGAGTTGGA ACAGCTCATG ACATCGGTTG AAAGGCCCAC CCGG&AACAG CTAAGCCAGC 2399 

TCTTTTGCCA AAGGATTCAT GCCGGTTTTC TAATCAACCT GCTCCCTAGC ATTGCCTGGA 24 59 

AGGAAAGGGT TCAGCy^GACT CCTCAAGAAG CAAGTTCAGT CTCAGGTGCT TGGATGCCAT 2519 

GCTCACCGAX TCCACTGGAT ATGAACTTGG CAGAGGAGCC TAGTTGTTGC CATGGAGACT 2579 

TAAAGAGCTC AGCACTCTGG AATCAAGATA CTGGACACTT GGGGCCGACT TGTTAAGGCT 2639 

CTGCAGCATC- AGACTGTAGA GGGGAAGGAA CACGTCTGCC CCCTGGTGGC CCGTCCTGGG 2699 

ATGACCTCGG GCCTCCTAGG CAACAAAAGA ATGAATTGGA AAGGATGTTC CTGGGTGTGG 2759 

CCTAGCTCCT GTGCTTGTGT GGATCCCTAA AGGGTGTGCT AAGGAGCAAT TGCACTGTGT 2819 

GCTGGACAGA AXTCCTGCTT ATAAATGCTT TTTGTTGTTG TTTTGTACAC TGAGCCCTGG 2879 

CTGAGCCACC CCACCCCACC TCCCATCCCA CCTTTACACG CCACTCTTGC ATGAGAACCT 2939 

GGCTGTCTCC CACTTGTAGC CTGTGGATGC TGAGGAAACA CCCAGCCAAG TAGACTCCAG 2999 

GCTTGCCCCT ATCTCCTGCT ATGAGTCTGG CCTCCTCATT GTGTTGTGGG AAGGAGACGG 3059 
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GTTCTGTCAf CTCGGAACGC CCACACCGTG GATGTGAACA ATGGCTGTAC TAGCTTAGAC 3ii9 

CAGCTTAGGG CTCTGCATAT CACAGGAGGG GGAGCAGGGA ACAATTTGAG TGCTGACCTA 3179 

TAACACAGTT CCTAAAGGAT CGGGCAGTCC AGAATCTCCT CCTTCAGTGT GTGTGTGTGT 3239 

GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT CCATGTTTGC ATGTATGTGT GTGCCAGTGT 3299 

■GTGGAGGCCC GAGGTTGGCT TTGGGTGTGT TTGATCACTC TCCAGTTACT GAGGCGGGCT 3 359 

CTCATCTGTA CCCAGAGCTT GCACATTTTC TAGTCTAACT TGATTCAGGG ATCTCTGTCT 3419 

GCCTATGGAG GTGCTCAGGT TACAGGCAGG CTGCCATACC TGCCCGACAT TTACATGAAT 3479 

ACTAGAGATC TGAATTCTGG TCCTCACACT TGTATACCTG CATTTTATCC ACTAAGACAT 3539 

CTCTCCAAGG GCTCCCCCTT CCTATTTAAT AAGTTAGTTT TGAACTGGCA AGATGGCTCA 3599 

GTGGGTAAGG CAGTTTGCGG ACAAACCTGA TGACCTGAGT TGGATCCCTG ACCATAAGGT 3659 

AGAAGAGACC TGATTCCTGC AAGTTGTCCT CTGACCACCA CCCCATACAT GCTTCTGCAT 3719 

ATGTGCACAC ATCACATTCT TGCACACACA CTCACATACC ATAAATGTAA TAAATTTTTT 3779 

TAAATAAATT GATTTTATCT TTTAAAAAAA AAAA 3813 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 amino acida 
(3) TYPE: amino acid 
(D) TOPOLO<JY: iinear 

(ii) MOLECULE TYPE: procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Aia Pro Ala Aia Leu Trp Vai Ala Leu Vai Phe Glu Leu Gin Leu 
i 5 10 15 

Trp Aia Thr Gly Hia Thr Val Pro Ala Gin Vai Vai Leu Thr Pro Tyr 
20 25 30 

Lya Pro Glu Pro Gly Tyr Glu Cya Gin lie Ser Gin Glu Tyr Tyr Asp 
35 40 45 

Arg Lya Ala Gin Met Cya Cya Ala Lya Cya Pro Pro Gly Gin Tyr Vai 
50 55 60 

Lya Hia Phe Cya Aan Lya Thr Ser Aap Thr Vai Cya Ala Aap Cya Glu 
65 70 75 80 

Ala Ser Met Tyr Thr Gin Val Trp Aan Gin Phe Arg Thr Cya Leu Ser 
85 90 . 95 

Cya Ser Ser Ser Cya Thr Thr Aap Gin Val Glu lie Arg Ala Cys Thr 
100 105 110 

Lys Gin Gin Aan Arg Vai Cys Aia Cya Glu Ala Gly Arg Tyr Cys Ala 
115 120 125 
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Leu Lys Thr Hia Ser 
L30 



Gly Ser Cya 
135 



Arg Gin Cys Met 
140 



Arg Leu 



S3r Ly* 



Cy3 Gly Pro Gly Phe Gly Val Ala Ser Ser Arg Ala Pro Asn Gly Aan 
145 150 155 160 

Val Leu Cya Lya Ala Cya Ala Pro Gly Thr Phe Ser Aap Thr Thr Ser 
165 170 175 

Ser Thr Aap Val Cya Arg Pro Hia Arg lie Cya Ser lie Leu Ala He 
180 185 190 

Pro Gly Aan Ala Ser Thr Aap Ala Val Cya Ala Pro Glu Ser Pro Thr 
195 200 205 

Leu Ser Ala He Pro Arg Thr Leu Tyr Val Ser Gin Pro Glu Pro Thr 
210 215 220 

Arg Ser Gin Pro Leu Aap Gin Glu Pro Gly Pro Ser Gin Thr Pro Ser 
225 230 235 240 

He Leu Thr Ser Leu Gly Ser Thr Pro He He Glu Gin Ser Thr Lya 
245 250 255 

Gly Gly He Ser Leu Pro He Gly Leu He Val Gly Val Thr Ser Leu 
260 265 270 

Gly Leu Leu Met Leu Gly Leu Val Asn Cya He II Leu Val Gin Arg 
275 280 285 

Lya Lya Lya Pro Ser Cya Leu Gin Arg Aap Ala Lya Val Pro His Val 
290 295 300 

Pro Aap Glu Lya Ser Gin Aap Ala Val Gly Leu Glu Gin Gin His Leu 
305 310 315 320 

Leu Thr Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala 
325 330 335 

Ser Ala Gly Aap Arg Arg Ala Pro Pro Gly Gly His Pro Gin Ala Arg 
340 345 350 

Val Met Ala Glu Ala Gin Gly Phe Gin Glu Ala Arg Ala Ser Ser Arg 
.355 360 365 

He Ser Asp Ser Ser His Gly Ser His Gly Thr Hia Val Asn Val Thr 
370 375 380 

Cys He Val Asn Val Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser 
335 390 395 400 

Ser Gin Ala Ser Ala Thr Val Gly Asp Pro Asp Ala Lys Pro Ser Ala 
405 410 415 

Ser Pro Lys Asp Glu Gin Val Pro Phe Ser Gin Glu Glu Cys Pro Ser 
420 425 430 

Gin Ser Pro Cys Glu Thr Thr Glu Thr Leu Gin Ser His Glu Lys Pro 
435 440 445 

Leu Pro Leu Gly Val Pro Asp Met Gly Met Lys Pro Ser Gin Ala Gly 
450 455 460 
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Trp Phe Aap Gin lie Ala Val Lya Val- Ala 
465 470 4 



33 



EP 0 418 014 A1 



\yA 1 — . 

1 H^THT-a 

^ 1 

' HuTNT-lAiea 

I I M««r* 
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GCGAGGCAGGCAGCCTGGAGAGAAGGCG 2 8 

CTGGGCTGCC^GGGCGCGAGGGCGCGAGGGCAGGGGGCAACCGGACCCCGCCCGCATCC 8 7 



ATG 


GCG 


CCC 


GTC 


GCC 


GTC 


TGG 


GCC 


GCG 


CTG 


GCC 


GTC GGA 


CTG 


GAG 


132 


Met 


Ala 


Pro 


Val 


Ala 


Val 


Trp 


AlaxAla 


Leu 


Ala 


Val Gly 


Leu 


Glu 


-3 


CTC 


TGG 


GCT 


GCG 


GCG 


CAC 


GCC 


TTG 


CCC 


GCC 


CAG 


GTG GCA 


TTT 


ACA 


177 


Leu 


Trp 


Ala 


Ala 


Ala 


His 


Ala 


Leu 


Pro 


Ala 


Gin 


Val Ala 


Phe 


Thr 


a 


ccc 


TAC 


GCC 


CCG 


GAG 


CCC 


GGG Srac 


ACA 


TGC 


CGG 


CTC AGA 


GAA 


TAC 


222 


Pro 


Tyr 


Ala 


Pro 


Glu 


Pro 


Gly 


Ser 


Thr 


Cys 


Arg 


Leu Arg 


Glu 


Tyr 


23 


TAT 


GAC 


CAG 


ACA 


GCT 


CAG 


ATG 


TGC 


TGC 


AGC 


AAA 


TGC TCG 


CCG 


GGC 


267 


Tyr 


Aap 


Gin 


Thr 


Ala 


Gin 


Met 


Cys 


Cys 


Ser 


Lys 


Cys Ser 


Pro 


Gly 


38 


CAA 


CAT 


GCA 


AAA 


GTC 


TTC 


TGT 


ACC 


AAG 


ACC 


TCG 


GAC ACC 


GTG 


TGT 


312 


Gin 


Hi3 


Ala 


Lys 


Val 


Phe 


Cys 


Thr 


Lys 


Thr 


Ser 


Asp Thr 


Val 


Cys 


53 


GAC 


TCC 


TGT 


GAG 


GAC 


AGC 


ACA 


TAC 


ACC 


CAG 


CTC 


TGG AAC 


TGG 


GTT 


357 


Asp 


Ser 


Cys 


Glu 


Asp 


Ser 


Thr 


Tyr 


Thr 


Gin 


Leu 


Trp Asn 


Trp 


Val 


68 


CCC 


GAG 


TGC 


TTG 


AGC 


TGT 


GGC 


TCC 


CGC 


TGT 


AGC 


TCT GAC 


CAG 


GTG 


402 


Pro 


Glu 


Cya 


Leu 


Ser 


Cys 


Gly 


Ser 


Arg 


Cys 


Ser 


Ser Asp 


Gin 


Val 


83 


GAA 


AC? 


CAA 


GCC 


TGC 


ACT 


CGG 


GAA 


CAG 


AAC 


CGC 


ATC TGC 


ACC 


TGC 


447 


Qlu 


Thr 


Gin 


Ala 


Cys 


Thr 


Arg 


Glu 


Gin 


Asn 


Arg 


lie Cys 


Thr 


Cys 


98 


AGG 


CCC 


GGC 


TGG 


TAC 


TGC 


GCG 


CTG 


AGC 


AAG 


CAG 


GAG GGG 


TGC 


CGG 


492 


Arg 


Pro 


Gly 


Trp 


Tyr 


Cys 


Ala 


Leu 


Ser 


Lys 


Gin 


Glu Gly 


Cys 


Arg 


113 


CTG 


TGC 


GCG 


CCG 


CTG 


CGC 


AAG 


TGC 


CGC 


CCG 


GGC 


TTC GGC 


GTG 


GCC 


537 


Leu 


Cys 


Ala 


Pro 


Leu 


Arg 


Lys 


Cys 


Arg 


Pro 


Gly 


Phe Gly 


Val 


Ala 


123 


AGA 


CCX 


GGA 


ACT 


GAA 


ACA 


TCA 


GAC 


GTG 


GTG 


TGC 


AAG CCC 


TGT 


GCC 


532 


Arg 


Pro 


Gly 


Thr 


Glu 


Thr 


Ser 


Asp 


Val 


Val 


Cys 


Lys Pro 


Cys Ala 

t 


143 


CCG 


GGG 


ACG 


TTC 


TCC 


AAC 


ACG 


ACT 


TCA 


TCC 


ACG 


GAT ATT 


TGC 


AGG 


627 


Pro 


Gly 


Thr 


Phe 


Ser 


Asn 


Thr 


Thr 


Ser 


Ser 


Thr 


Asp lie 


Cys 


Arg 


153 


CCC 


CAC 


CAG 


ATC 


TGT 


AAC 


GTG 


GTG 


GCC 


AXw 






GCA 


AGC 




Pro 


His 


Gin 


Ila 


Cys 


Asn 


Val 


Val 


Ala 


lie 


Pro 


Gly Asn 


Ala 


Ser 


173 


AJS 


GAT 


GCA 


GTC 


TGC 


ACG 


TCC 


ACG 


TCC 


CCC 


ACC 


CGG AGT 


ATG 


GCC 


717 


Meg 


Aap 


Ala 


Val 


Cys 


Thr 


Ser 


Thr 


Ser 


Pro 


Thr 


Arg Ser 

r 


Met 


Ala 


183 


CCA 


GGG 


GCA 


GTA 


CAC 


TTA 


CCC 


CAG 


CCA 


GTG 


TCC 


ACA CGA 


TCC 


CAA 


762 


Pro 


Gly 


Ala 


Val 


His 


Leu 


Pro 


Gin 


Pro 


Val 


Ser 


Thr Arg 


Ser 


Gin 


203 


CAC 


ACG 


CAG 


CCA 


ACT 


CCA 


GAA 


CCC 


AGC 


ACT 


GCT 


CCA AGC 


ACC 


TCC 


307 


His 


Thr 


Gin 


Pro 


Thr 


Pro 


Glu 


Pro 


Ser 


Thr 


Ala 


Pro Ser 


Thr 


Ser 


213 


TTC 


CTG 


CTC 


CCA 


ATG 


GGC 


CCC 


AGC 


CCC 


CCA 


GCT 


GAA GGG 


AGC 


ACT 


352 


Phe 


Leu 


Leu 


Pro 


Met Gly 


Pro 


Ser 


Pro 


Pro 


Ala 


Glu Gly 


Ser 


Thr 


233 


GGC 


GAC 


'ttc 


GCT 


CTT 


CCA 


GTT 


GGA 


CTG 


ATT 


GTG 


GGT GTG 


ACA 


GCC 


397 


Gly 


Aap 


2h&. 
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J^Ai. 


Gly Yal 




Ma 


243 
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987 
278 



TTG GGT CTA CTA ATA ATA GGA GTG GTG AAC TGT GTC ATC ATG ACC 94 2 

Irf^U GIV T^U I^n r.^^, V ^ l y^^ y^^ ^ 

CAG GTG AAA AAG AAG CCC TTG TGC CTG GAG AGA GAA GCC AAG GTG 
G i n Yfl] . Lya Lya Lys Pro Leu Cys L«u Gin Axg Giu AJLa Lys Vai 

CCT CAC TTG CCT GCC GAT AAG GCC CGG GGT ACA CAG GGC CCC GAG 1032 

Pro Hia Leu Pro Ala Aap Lya Ala Arg Gly Thr Gin Gly Pro Glu 2 93 

CAG CAG CAC CTG CTG ATC ACA GCG CCG AGC TCC AGC AGC AGC TCC 1077 

Gin Gin Hia Leu Leu He Thr Ala Pro Ser Ser Ser Ser Ser Ser 308 

CTG GAG AGC TCG GCC AGT GCG TTG GAC AGA AGG GCG CCC ACT CGG 1122 

Leu Glu Ser Ser Ala Ser Ala Leu Aap Arg Arg Ala Pro Thr Arg 323 

AAC CAG CCA CAG GCA CCA GGC GTG GAG GCC AGT GGG GCC GGG GAG 1167 

Aan Gin Pro Gin Ala Pro Gly Val Glu Ala Ser Gly Ala Gly Glu 338 

GCC CGG GCC AGC ACC GGG AGC TCA GAT TCT TCC CCT GGT GGC CAT 1212 

Ala Arg Ala Ser Thr Gly Ser Ser Aap Ser Ser Pro Gly Gly Hia 353 

GGG ACC CAG GTC AAT GTC ACC TGC ATC GTG AAC GTC TGT AGC AGC 1257 

Gly Thr Gin Val Aan Val Thr Cya He Val Asn Val Cys Ser Ser 368 

TCT GAC CAC AGC TCA CAG TGC TCC TCC CAA GCC AGC TCC ACA ATG 1302 

Ser Asp Hia Ser Ser Gin Cys Ser Ser Gin Ala Ser Ser Thr Met 38 3 

GGA GAC ACA GAT TCC AGC CCC TCG GAG TCC CCG AAG GAC GAG CAG 1347 

Gly Aap Thr Asp Ser Ser Pro Ser Glu Ser Pro Lya Aap Glu Gin 398 

GTC CCC TTC TCC AAG GAG GAA TGT GCC TTT CGG TCA CAG CTG GAG 1392 

Val Pro Phe Ser Lys Glu Glu Cya Ala Phe Arg Ser Gin Leu Glu 413 

ACG CCA GAG ACC CTG CTG GGG AGC ACC GAA GAG AAG CCC CTG CCC 14 37 

Thr Pro Glu Thr Leu Leu Gly Ser Thr Glu Glu Lya Pro Leu Pro 428 

CTT GGA GTG CCT GAT GCT GGG ATG AAG CCC AGT 1470 

Leu Gly Val Pro Aap Ala Gly Met Lya Pro Ser 439 

TAACCAGGCCGGTGTGGGCTGTGTCGTAGCCAAGGTGGGCTGAGCCCTGGCAGGATGAC 
CCTGCGAAGGGGCCCTGGTCCTTCCAGGCCCCCACCACTAGGACTCTGAGGCTCTTTCT 
GGGCCAAGTTCCTCTAGTGCCCTCCACAGCCGCAGCCTCCCTCTGACCTGCAG. . , 
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CGCAGCTGAGGCACTAGAGCTCC 


23 


AGGCACAAGGGCGGGAGCCACCGCTGCCCCT 


ATG 
Met 


GCG 
Ala 


CCC 
Pro 


GCC GCC 
Ala Ala 


CTC 
Leu 


TGG 
Trp 


75 
-16 


GTC 
Val 


GCG 
Ala 


CTG 
L>eu 


GTC 
Val 


TTC 
Phe 


GAA 

Glu 


CTG 
Leu 


CAG 
Gin 


CTG 
Leu 


TGG GCC 
Trp Ala 


ACC GGG 
Thr Gly 


CAC 
His 


ACA 

Thr 


120 


GTG CCC 
ial Pro 


GCC 
Ale 


CAG 
Gin 


GTT 
Val 


GTC 
Val 


TTG 
Leu 


ACA 

Thr 


CCC 
Pro 


TAC 

Tyr 


AAA 

Lys 


CCG GAA 
Pro Glu 


CCT 
Pro 


GGG 
Gly 


165 
15 


TAG 
Tyr 


GAG 

Glu 


TGC 
Cys 


CAG 


ATC 
He 


TCA 
Ser 


CAG 

Gin 


GAA 

Glu 


TAC TAT GAC AGG AAG GCT CAG 
Tyr Tyr Asp Arg Lys Ala Gin 


210 
30 


ATG 
Met 


TGC 
Cys 


TGT 
Cys 


GCT 
Ala 


AAG 
Lys 


TGT 
Cys 


CCT 
Pro 


CCT GGC CAA 
Pro Gly Gin 


TAT GTG AAA 
Tyr Val Lys 


CAT 
His 


TTC 
Phe 


255 
45 


TGC 
Cys 


AAC 
Asn 


AAG 

Lys 


ACC 
Thr 


TCG GAC ACC GTG 
Ser Asp Thr Val 


TGT GCG 
Cys Ala 


GAC 
Asp 


TGT GAG 
Cys Glu 


GCA 
Ala 


AGC 
Ser 


300 
60 


ATG 
Met 


TAT 
Tyr 


ACC 
Thr 


CAG 
Gin 


GTC 
Val 


TGG AAC 
Trp Asn 


CAG 

Gin 


TTT 
Phe 


CGT 
Arg 


ACA 

Thr 


TGT TTG 
Cys Leu 


AGC 
Ser 


TGC 
Cys 


345 
75 


AGT 
Ser 


TCT 
Ser 


TCC 
Ser 


TGT 
Cys 


ACC 
Thr 


ACT 
Thr 


GAC 
Asp 


CAG 
Gin 


GTG 
Val 


GAG 
Glu 


ATC 
lie 


CGC GCC 
Arg Ala 


TGC 
Cys 


ACT 
Thr 


390 
90 


AAA GAG GAG AAC CGA GTG TGT GOT TGC GAA 
Lys Gin Gin A^n Ar? Val Cys Ala Cys Glu 


GCT 
Ala 


GGC AGG TAC TgC 
Gly Arg Tyr Cys 


435 
105 


GCC 
Ala 


TTG 
Leu 


AAA 

Lys 


ACC 
Thr 


CAT 
His 


TCT GGC 
Ser Gly 


AGC 
Ser 


TGT 
Cys 


CGA CAG 

Arg Gin 


TGC ATG 
Cys Met 


AGG 
Arg 


CTG 
Leu 


430 
120 


AGC 
Ser 


AAG 
Lys 


TGC 
Cys 


GGC 
Gly 


CCT 
Pro 


GGC 
Gly 


TTC GGA GTG GCC 
Phe Gly Val Ala 


AGT 
Ser 


TCA AGA GCC 
Ser Arg Ala 


CCA 
Pro 


525 
135 


AAT 
Asn 


GGA AAT GTG 
Gly Asn Val 


CTA TGC 
Leu Cys 


AAG 
Lys 


GCC 
Ala 


TGT 
Cys 


GCC 
Ala 


CCA GGG ACG 
Pro Gly Thr 


TTC 
Phe 


TCT 
Ser 


570 
150 


GAC 
Asp 


ACC 
Thr 


ACA 
Thr 


TCA 
Ser 


TCC 
Ser 


ACT GAT GTG 
Thr Asp Val 


TGC 
Cys 


AGG 
Arg 


CCC 
Pro 


CAC CGC 
His Arg 


ATC TGT 
He Cys 


615 
165 


AGC 
Ser 


ATC 
lie 


CTG 
Leu 


GCT 
Ala 


ATT CCC GGA AAT GCA 
lie Pro Gly Asn Ala 


AGC ACA GAT GCA GTC TGT 
Ser Thr Asp Ala Val Cys 


660 
180 


GCG 
Ala 


CCC 
Pro 


GAG 

Glu 


TCC 
Ser 


CCA 
Pro 


ACT 
Thr 


CTA 
Leu 


AGT 
Ser 


GCC 
Ala 


ATC 
lie 


CCA 
Pro 


AGG ACA 
Arg Thr 


CTC 
Leu 


TAC 
Tyr 


705 
195 


GTA 
Val 


TCT 
Ser 


CAG 

Gin 


CCA 
Pro 


GAG 

Glu 


CCC 
Pro 


ACA AGA 
Thr Arg 


TCC 
Ser 


CAA 

Gin 


CCC 
Pro 


CTG GAT 
Leu Asp 


CAA 

Gin 


GAG 

Glu 


750 
210 


CCA 
Pro 


GGG CCC 
Gly Pro 


AGC 
Ser 


CAA 

Gin 


ACT 
Thr 


CCA 
Pro 


AGC 
Ser 


ATC 
lie 


CTT 
Leu 


ACA 
Thr 


TCG TTG 
Ser Leu 


GGT 
Gly 


TCA 
Ser 


795 
225 


ACC 
Thr 


CCC 
Pro 


ATT 
lie 


ATT 
He 


GAA 

Glu 


CAA 

Gin 


AGT 
Ser 


ACC 
Thr 


AAG 


GGT 

Glv 


GGC 
Glv 


ATC TCT 

lie Sfic 


CTT 

Lau 


CCA 


840 
240 


ATT 
lift 


GGT 

Gly 


CTG 


ATT 
JLLfi. 


GTT 


GGA 

Gly 


GTG 
Val 


ACA 


TCA 


CTG GGT 

Leu gly 


CTG CTG 

L#u Leu 


ATG 


TTA 


885 
255 
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GGA C7G GTG 

Gly L^u Val 


AAC 


TGC 


ATC 

ri« 


ATC 

rif 


CTG 


GTG CAG 


AGG 
Arg 

CAT 

HJ.3 


AAA 
Lys 

GTG 
Vai 


AAG 
Lys 


AAG 
Lys 


CCC 
Pro 


930 
270 


TCC TGC CTA 
Ser Cya L«u 


CAA 

Gin 


AGA GAT GCC 
Arg \sp Ala 


AAG 


GTG 
Vai 


CCT 
Pro 


CCT 
Pro 


GAT 
Asp 


GAG 
Glu 


975 
285 


AAA TCC CAG 
Lya S«r Gin 


GAT 
Aap 


GCA 
Ala 


GTA 
Vai 


GGC 
Gly 


CTT 
Leu 


GAG 

Giu 


CAG 
Gin 


CAG 
Gin 


CAC 
His 


CTG 
Leu 


TTG 
Leu 


ACC 
Thr 


1020 
300 


ACA GCA CCC 
Thr Ala Pro 


AGT 
Ser 


TCC 
Ser 


AGC 
Ser 


AGC 
Ser 


AGC 
Ser 


TCC 
Ser 


CTA 
Leu 


GAG 
Giu 


AGC 
Ser 


TCA 
Ser 


GCC 
Aia 


AGC 
Ser 


1065 
315 


GCT GGG GAC CGA AGG GCG 
Ala Gly Asp Krg Arg Ala 


CCC 
Pro 


CCT 
Pro 


GGG GGC CAT 
Giy Gly His 


CCC 
Pro 


CAA 

Gin 


GCA 
Ala 


AGA 
Arg 


1110 
330 


GTw ATG GCG 
Vai Met Ala 


GAG 
Glu 


GCC 
Ala 


CAA GGG TTT CAG GAG GCC CGT GCC 
Gin Gly Phe Gin Glu Ala Arg Aia 


AGC 
Ser 


TCC 
Ser 


1155 
345 


AGG ATT TCA 
Arg lie Ser 


GAT 
Ksp 


TCT 
Ser 


TCC 
Ser 


CAC 
Hi3 


GGA 
Gly 


AGC 
Ser 


CAC 
His 


GGG ACC 
Giy Thr 


CAC 
His 


GTC 
Vai 


AAC 
Asn 


1200 
360 


GTC ACC TGC 
Vai Thr Cy5 


ATC 
He 


GTG 
Vai 


AAC 
Aan 


GTC 
Vai 


TGT 
Cys 


AGC 
Ser 


AGC 
Ser 


TCT 
Ser 


GAC 
Asp 


CAC 
His 


AGT 
Ser 


TCT 
Ser 


1245 
375 


CAG TGC TCT 
Gin Cys Ser 


TCC 
Ser 


CAA 

Gin 


GCC 
Aia 


AGC 
Ser 


GCC 
Aia 


ACA 

Thr 


GTG 
Vai 


GGA GAC CCA GAT GCC 
Giy Asp Pro Asp Aia 


1290 
390 


AAG CCC TCA 
Lya Pro Ser 


GCG 
Ala 


TCC 
Ser 


CCA 
Pro 


AAG 
Lys 


GAT GAG 
Asp Giu 


CAG 
Gin 


GTC 
Vai 


CCC 
Pro 


TTC 
Phe 


TCT 
Ser 


CAG 
Gin 


1335 
405 


GAG GAG TGT 
Glu Glu Cys 


CCG 
Pro 


TCT 
Ser 


CAG 
Gin 


TCC 
Ser 


CCG 
Pro 


TGT GAG 
Cys Giu 


ACT 
Thr 


ACA 
Thr 


GAG 
Giu 


ACA 
Thr 


CTG 
Leu 


1380 
420 


CAG AGC CAT 
Gin Ser His 


GAG 
Glu 


AAG 
Lys 


CCC 
Pro 


TTG 
Leu 


CCC 
Pro 


CTT GGT GTG CCG GAT ATG 
Leu Giy Vai Pro Asp Met 


GGC 
Giy 


1425 
435 


ATG AAG CCC 
Met Lys Pro 


AGC CAA GCT GGC TGG TTT GAT CAG 
Ser Gin Ala Gly Trp Phe Asp Gin 


ATT 

lie 


GCA 
Aia 


GTC 
Vai 


AAA 

Lys 


1470 
450 


GTG GCC 
Vai Ala 


























1476 
452 



TGACCCCTGACAGGGGTAACACCCTGCAAAGGGACCCCCGAGACCCTGAACCCATGGAAC 1536 

TTCATGACTTTTGCTGGATCCArTTCCCTTAGTGGCTTCCAGAGCCCCAGTTGCAGGTCA 1596 

AGTGAGGGCTGAGACAGCTAGAGTGGTCAAAAACTGCCATGGTGTTTTATGGGGGCAGTC 1656 

CCAGGAAGTTGTTGCTCTTCCATGACCCCTCTGGATCTCCTGGGCTCTTGCCTGATTCTT 1716 

GCTTCTGAGAGGCCCCAGTATTTTTTCCTTCTAAGGAGCTAACATCCTCTTCCATGAATA 1776 

GCACAGCTCTTCAGCCTGAATGCTGACACTGCAGGGCGGTTCCAGCAAGTAGGAGCAAGT 1836 

GGTGGCCTGCTAGGGCACAGAGGCCCTTCAGGTTAGTGCTAAACTCTTAGGAAGTACCCT 1896 

CTCCAAGCCCACCGAAATTCTTTTGATGCAAGAATCAGAGGCCCCATCAGGCAGAGTTGC 1956 

TCTGTTATAGGATGGTAGGGCTGTAACTCAgTGGTCCAgTGTGCTTTTAGCATGCCCTGG 2016 

GTTTGATCCTCAGCAACACATGCAAAACGTAAGTAGACAGCAGACAGCAGACAGCACAGC 2076 

CAGCCCCCTGTGTGGTTTGCAGCCTCTGCCTTTGACTTTTACTCTGGTGGGCACACAGAG 2136 

GGCTGGAGCTCCTCCTCCTGACCTTCTAATGAGCCCTTCCAAGGCCACGCCTTCCTTCAG 2196 

GGAATCTCAGGGACTGTAGAGTTCCCAGGCCCCTGCAGCCACCTGTCTCTTCCTACCTCA 2256 

GCCTGGAGCAcTCCCTCTAACTCCCCAACGgCTTGGTACTGTACTTGCTGTGACCCCAAC 2316 

GTGCATTGTCCGGGTTAGGCACTGTGAGTTGGAACAGCTcATGACATCGGTTGAAAGGCC 2376 

CACCCGGAAACAGCTAAGCCAGCTCTTTTGCCAAAGGATTCATGCCGGTTTTCTAATCAa 2436 

CCTGCTCCCTAGCATTGCCTGGAAGGAAAGGGTTCAGGAGACTCCTCAAGAAGCAAGTTC 2496 

AGTCTCAGGTGCTTGGATGCCATGCTCACCGATTCCACTGGATATGAACTTGGCAGAGGA 2556 
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GCCTAGTTGTTGCCATGGAGACTTAAAGAGCTCAGCACTCTGCSAATCAAGATACTGGACA 2616 

CTTGGGGCCGACTTGTTAAGGCTCTGCAGCATCAGACTGTAGAGGGGAAGGAACACGTCT 267 6 

GCCCCCTGGTGGCCCGTCCTGGGAtGACCTCGGGCCtCCTAGGCAACAAAAGAATGAATT 2736 

GGAAAGGATGTTCCTGGGTGTGGCCTAGCTCCTGTGCTTGTGTGGATCCCTAAAGGGTGT 2796 

GCTAAGGAGCAATTGCACTGTGTGCTGGACAGAATTCCTGCTTATAAATGCTTTTTGTTG 29 5 6 

TTGXTTTGTACACTGAGCCCTGGCTGAGCCACCCCJ^CCCCACCTCCCATCCCACCTTTAC 2916 

ACGCCACTCTTGCATGAGAACCTGGCTGTCTCCCXCTTGTAGCCTGTGGATGCTGAGGAA 297 6 

ACACCCAGCCXAGTAGACrCCAGGCTTgCCCCTATCTCCTGcTaTGAGTctTOCCTCCTC 3036 

AtTgTGTTGTGGGAAgGAGACGGGtTCTGTCATCTCGGAAcgCCCACACCGTGGATGTGA 3096 

ACAaTGGCTGTACTAGCTTAGACCAqCrTAGGGCTCTGCATATCACAGGAGGGGGAGCAG 3156 

GGAACAATTTGAGTGCTGACCTATAACACAgTTCCTAAAGGATCGGGCAGTCCAGAATCT 3216 

CCTCCTTCAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCCATGTT 327 6 

TGCATGTATGTGTGTGCCAGTGTGTGGAGGCCCGAGGTTGGCTTTGGGTGTGTTTGATCA 3336 

CTCTCCAGTTACTGAGGCGGGCTCTCATCTGTACCCAGAGCTTGCACArrrTCTAGTCTA 3396 

ACTTGATTCAGGGATCTCTGTCTGCCTATGGAGgTGCTCAGGTTACAGGCAGGCTGCCAT 3456 

ACCTGCCCGACATTTACATGAATACTAGAGATCTGAATTCrGGTCCTCACACTTGTATAC 3516 

CTGGATTTTATCCACTAAGACATCTCTCCAAGGGCTCCCCCTTCCTATTTAATAAGTTAG 3576 

TTTTGAACTGGCAAGATGGCTCAGTGGGTAAGGCAGTTTGCGGACAAACCTGATGACCTG 3636 

AGTTGGATCCCTGACCATAAGGTAGAAGAGACCTGATTCCTGCAAGTTGTCCTCTGACCA 3 696 

CCACCCCATACATGCTTCTGCATATGTGCACACATCACATTCTTGCACACACACTCACAT 3756 

ACCATAAATGTAATAAATTTTTTTAAATAAAT TGATTTTATCtrTTAAAAAAAAAAA 3 813 
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